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(54) Title: FREE REACTANT USE IN NUCLEIC ACID-TEMPLATED SYNTHESIS 



\^ (57) Abstract: The present invention provides methods and compositions for expanding the scope of chemical reactions that can 
be performed during nucleic acid-templated organic syntheses. In particular, nucleic acid-templated chemistries are used to pro- 
duce reaction intermediates attached to an oligonucleotide that can be used to identify the reaction intermediates and/or the resulting 
reaction products. The reaction intermediates then are reacted with free reactants (for example, reactants that are difficult or imprac- 
tical to couple to an oligonucleotide) to produce a reaction product. This approach expands the scope of reagents useful in nucleic 
acid-templated syntheses to reagents that do not need to be or cannot be tethered to an oligonucleotide. The reagents, however, still 
permit the synthesis of reaction products attached to oligonucleotides that can be used to identify the reaction products. 
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FREE REACTANT USE IN NUCLEIC ACID-TEMPLA TED SYNTHESIS 



RELATED APPLICATIONS 

[0001] This application claims the benefit of and priority to U.S. Patent Application Serial 
No, 60/646,584, filed January 21, 2005, the entire disclosure of which is incorporated by 
reference herein. 

GOVERNMENT FUNDING 

[0002] The research described in this application was sponsored, in part, by the Office of 
Naval Research under Contract No. N000 14-03-1 -0749 and by the NIH/NIGMS under grant R01 
GM065865. The United States Government may have certain rights in the invention. 

FIELD OF THE INVENTION 

[0003] The invention relates generally to methods and compositions for performing nucleic 
acid-templated synthesis. More particularly, the invention relates to methods and compositions 
for performing nucleic acid-templated synthesis to produce reaction intermediates, which can 
1 5 then be chemically transformed into reaction products using free reactants that react with the 
reaction intermediates to produce the reaction products, 

BACKGROUND OF THE INVENTION 

[0004] Nucleic acid-templated organic synthesis enables modes of controlling reactivity that 
are not possible in a conventional synthesis format and allows synthetic molecules to be 

20 manipulated using translation, selection, and amplification methods previously available only to 
biological macromolecutes (Gartner et aL (2001) J. Am, Chem. Soc. 123: 6961-3; Gartner et aL 
(2002) Angew. Chem., Int. Ed. Engl. 123: 61 796-1 800; Gartner et aL (2002) J. Am. Chem. 
Soc. 124: 10304-6; Calderone et aL (2002) Angew. Chem., Int. Ed. Engl. 41: 4104-8; Gartner 
et aL (2003) Angew. Chem., Int. Ed. Engl. 42: 1370-5; Li et aL (2004) J. Am. Chem. Soc. 124: 

25 5090-2; Kanan et aL (2004) NATURE 431: 545-9; Gartner et aL (2004) SCIENCE 305: 1601-5; Li 
et aL (2004) ANGEW. CHEM. Int. Ed. 43: 4848-70; Brenner et aL (1992) Proc. Natl. Acad. Sci, 
USA 89: 5181; Doyon et aL (2003) J. Am. Chem. Soc. 125: 12372-3; Halpin et aL (2004) PLoS 
Biol. 2: el 74). The structures that can be accessed through nucleic acid-templated synthesis, in 
particular, DNA-templated organic synthesis, or DTS, have been limited predominantly to 
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products of coupling reactions between two nucleic acid-linked reactants. In some cases, 
however, reactants are difficult or impossible to tether to an oligonucleotide. The development 
of strategies that enable non-oligonucleotide linked small-molecule reagents to react in a 
sequence-programmed or sequence-recorded manner, therefore, would significantly expand the 
5 synthetic capabilities of nucleic acid-templated synthesis. 

SUMMARY OF THE INVENTION 

[0005] The present invention provides methods and compositions for expanding the scope of 
nucleic acid-templated organic syntheses by addressing the need for reagents to be tethered to 
oligonucleotides. When the linkage of reagents to a nucleic acid, for example, DNA, is not 
10 possible or convenient, these transformations allow such reagents to nevertheless contribute to 
small molecule syntheses while preserving the correspondence between nucleic acid sequence 
and the structure of the product. In addition, by decoupling the nucleic acid-templated step from 
the coupling reaction, this approach allows bond formation to take place under conditions that do 
not necessarily support nucleic acid hybridization. 

15 [0006] In one aspect, the invention provides a method of synthesizing a reaction product. 

The method comprises the steps of: (a) providing a mixture comprising a first reactive unit and a 
second reactive unit under conditions to permit a reaction between the first and second reactive 
units to form a reaction intermediate; (b) providing an oligonucleotide comprising an identifying 
sequence attached to the reaction intermediate; and (c) combining the reaction intermediate with 

20 a free reactant selectively reactive with the reaction intermediate thereby to produce a reaction 
product linked to the identifying sequence. In this approach, the free reactant is more reactive 
with the reaction intermediate than with either of the reactive units in the starting mixture. 

[0007] In another aspect, the invention provides a method of synthesizing a reaction product 
via nucleic acid-templated synthesis as described, for example, in U.S. Patent Application Serial 

25 Number 10/643,752, which published under U.S. Patent Application Publication Number 

US2004/0 180412. The method comprises the steps of (a) providing a mixture comprising (i) a 
first reactive unit attached to a first oligonucleotide comprising a codon sequence, and (ii) a 
second reactive unit attached to a second oligonucleotide comprising an anti-codon sequence 
complementary to the codon sequence; (b) annealing the codon sequence of the first 

30 oligonucleotide with the anti-codon sequence of the second oligonucleotide to induce a reaction 
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betwedn the first and second reactive units to form a reaction intermediate attached at least to the 
first oligonucleotide; and (c) combining the reaction intermediate with a free reactant selectively 
reactive with the reaction intermediate thereby to synthesize a reaction product still attached to 
the first oligonucleotide sequence. The free reactant preferably is more reactive with the reaction 
intermediate than with at least one of the reactive units in the starting mixture. 

[0008] Similarly, to the extent that multiple different first reactive units (and optionally 
second reactive units) are present in the initial reaction mixture, it is possible that, under certain 
reaction conditions, multiple different reaction intermediates may be created. Accordingly, it 
may be advantageous for the free reactant to be selectively reactive with just one specific type of 
reaction intermediate in the mixture. Alternatively, it may be advantageous for the free reactant 
to be selectively reactive with a group or sub-group of reaction intermediates, where the reaction 
intermediates have a functional group with a particular chemical functionality. For example, the 
free reactant may be selectively reactive with reactive intermediates containing a free amine as 
compared to other reactive intermediates lacking a free amine. 

[0009] Furthermore, by knowing the codon and/or anticodon sequences it is possible to 
determine which second reactive unit reacted with the first reactive unit to produce the reactive 
intermediate and/or the reaction product. Furthermore, if the first oligonucleotide provides a 
sequence identifier for the first reactive unit attached to the first oligonucleotide, it is possible to 
determine what first reaction unit reacted with the second reactive unit to produce the reaction 
intermediate and/or the reaction product. Based upon the reaction conditions, the reactants 
present in a reaction mixture containing reaction intermediates, and information concerning 
when certain free reactants are added to the mixture containing reaction intermediates, it can be 
possible to determine what free reactant reacted with a reaction intermediate to create a specific 
reaction product. This information can be used to identify the reaction product and the reaction 
pathway by which it was made. 

[0010] The foregoing aspects and features of the invention may be further understood by 
reference to the following drawings, detailed description, examples, and claims. 
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DEFINITIONS 

[0011] The terms, "codon" and "anti-codon" as used herein, refer to complementary 
oligonucleotide sequences in a template and in a transfer unit, respectively, that permit the 
transfer unit to anneal to the template during nucleic acid-templated synthesis. 

5 [0012] The term, "free reactant" as used herein refers to a chemical reagent or chemical 
moiety that is not linked to an oligonucleotide that can participate in nucleic acid-templated 
synthesis. In comparison, the first and second reactive units and the transfer units are attached to 
oligonucleotides that can participate in nucleic acid-templated synthesis. 

[0013] The terms, "oligonucleotide" or "nucleic acid" as used herein refer to a polymer of 
10 nucleotides. The polymer may include, without limitation, natural nucleosides adenosine, 
thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and 
deoxycytidine), nucleoside analogs {e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo- 
pyrimidine, 3-methyl adenosine, 5-methylcytidine, C5-bromouridine, C5-fluorouridine, 
C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 
15 7-deazaadenosine, 7-deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, 
and 2-thiocytidine), chemically modified bases, biologically modified bases {e.g., methylated 
bases), intercalated bases, modified sugars {e.g., 2*-fluororibose, ribose, 2'-deoxyribose, 
arabinose, and hexose), or modified phosphate groups {e.g., phosphorothioates and 5' 
-N-phosphoramidite linkages). Nucleic acids and oligonucleotides may also include other 
20 polymers of bases having a modified backbone, such as a locked nucleic acid (LNA), a peptide 
nucleic acid (PNA), a threose nucleic acid (TNA) and any other polymers capable of serving as a 
template for an amplification reaction using an amplification technique, for example, a 
polymerase chain reaction, a ligase chain reaction, or non-enzymatic template-directed 
replication. 

25 [0014] The term, "reactive unit" as used herein, refers to a chemical reagent or chemical 
moiety (including, for example, but not limited to, a building block, monomer, monomer unit, 
small molecule scaffold, or other reactant useful in nucleic acid-templated chemical synthesis) 
that can participate in a chemical reaction with another chemical reagent or chemical moiety to 
produce a reaction intermediate and/or a reaction product. 
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[0015] The term, "reaction intermediate" as used herein, refers to a chemical reagent or a 
chemical moiety that can be chemically transformed into a different reagent or chemical moiety 
with a free reactant. 

[0016] The term, "small molecule" as used herein, refers to an organic compound either 
synthesized in the laboratory or found in nature having a molecular weight less than 10,000 
grams per mole, optionally less than 5,000 grams per mole, and optionally less than 2,000 grams 
per mole. 

[0017] The term, "small molecule scaffold" as used herein, refers to a chemical compound 
having at least one site or chemical moiety suitable for functionalization. The small molecule 
scaffold or molecular scaffold may have two, three, four, five or more sites or chemical moieties 
suitable for functionalization. These functionalization sites may be protected or masked as 
would be appreciated by one of skill in this art. The sites may also be found on an underlying 
ring structure or backbone. The small molecule scaffolds are not nucleic acids, nucleotides, or 
nucleotide analogs. 

[0018] The term, "transfer unit" as used herein, refers to a molecule comprising an 
oligonucleotide having an anti-codon sequence attached to a reactive unit including, for example, 
but not limited to, a building block, monomer, monomer unit, small molecule scaffold, or other 
reactant useful in nucleic acid-templated chemical synthesis. 

[0019] The term, "template" as used herein, refers to a molecule comprising an 
oligonucleotide having at least one codon sequence suitable for a nucleic acid-templated 
chemical synthesis. The template optionally may comprise (i) a plurality of codon sequences, 
(ii) an amplification means, for example, a PCR primer binding site or a sequence 
complementary thereto, (iii) a reactive unit associated therewith, (iv) a combination of (i) and 
(ii), (v) a combination of (i) and (iii), (vi) a combination of (ii) and (iii), or a combination of (i), 
(ii) and (iii). 

[0020] Throughout the description, where compositions are described as having, including, 
or comprising specific components, or where processes are described as having, including, or 
comprising specific process steps, it is contemplated that compositions of the present invention 
also consist essentially of, or consist of, the recited components, and that the processes of the 
present invention also consist essentially of, or consist of, the recited processing steps. Further, 
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it should be understood that the order of steps or order for performing certain actions are 
immaterial so long as the invention remains operable. Moreover, unless specified to the 
contrary, two or more steps or actions may be conducted simultaneously. 

DESCRIPTION OF THE DRAWINGS 

[0021] FIG. 1 is a schematic illustration of one aspect of the present invention in which a 
first reactive unit (FRU) and a second reactive unit (SRU) react to form a reaction intermediate 
(RI). The reaction intermediate (RI) is attached to an identifying sequence (IS). The RI-IS 
complex is combined with a free reactant (FR), which is selectively reactive with the RI to yield 
a reaction product (RP) linked to the IS. 

[0022] FIG. 2 is a schematic illustration of an aspect of the present invention, in which prior 
to the formation of the reaction intermediate (RI) the first reactive unit (FRU) and the second 
reactive unit (SRU) are linked to a codon sequence (CS) and a complementary anti-codon 
sequence (ACS), respectively. The CS and the ACS anneal to one another to permit the FRU 
and SRU to react with one another to produce the RI. The CS remains linked to the RI. The CS 
remains attached to the reaction product (RP) when the RI has been reacted with the free reactant 
(FR). 

[0023] FIG. 3 is a schematic illustration of an embodiment of the scheme shown in FIG, 1, 
in which, in a mixture of a population of first reactive units (FRU1-FRU4), at least one of the first 
reactive units (FRUi) reacts with a second reactive unit (SRUi) to form a reaction intermediate 
(RIi) which coexists with the population of first reactive units (FRU 2 -FRU 4 ). The reaction 
intermediate (RIi) is linked to an identifying sequence (IS). The reaction intermediate (RIi) 
linked to the IS and coexisting with the population of first reactive units (FRU 2 -FRU 4 ) then is 
permitted to react with a free reactant (FR), which is selectively reactive with RIi to yield a 
reaction product (RPi) linked to the IS. 

[0024] FIG. 4 is a schematic illustration of an embodiment of the scheme shown in FIG. 2, 
in which prior to the formation of the reaction intermediate (RIO the first reactive units (FRU r 
FRU 4 ) and the second reactive unit (SRU) are linked to codon sequences (CS) and a 
complementary anti-codon sequence (ACS), respectively. The CS remains linked to RI,. The 
CS remains attached to the reaction product (RP0 when RIj has been reacted with the free 
reactant (FR) to produce RPi . 
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[0025] FIG. 5A depicts DNA-templated transformation of azides into primary amines (top 
scheme), carboxylic acids (middle scheme), and thiols (bottom scheme). FIG, 5B depicts 
exemplary DNA-templated reactions using substrate azides 1-12, in which the listed yields 
represent lower limits. 

5 [0026] FIG- 6 A is a representation of a denaturing polyacrylamide gel electrophoresis 

(PAGE) gel following DNA-templated azide-to-amine transformation of azide 3 from FIG. 5B. 
Lane 1 contains azide-linked 30-mer template. Lane 2 contains azide-linked 30-mer template + 
carboxylic acid-linked 20-mer capture reagent + 30 mM EDC 4- 15 mM sulfo-NHS (showing no 
product formation). Lane 3 contains azide-linked 30-mer template + phosphine-linked 10-mer 

1 0 reagent (1 0~mer not visible). Lane 4 contains azide-linked 30-mer template + phosphine-linked 
10-mer reagent + carboxylic acid-linked 20-mer capture reagent + 30 mM EDC + 15 mM sulfo- 
NHS, the 50-mer secondary product arising from azide-to-amine transformation following by 
DNA-templated amine acylation is visible. Lane 5 contains azide-linked 30-mer template + 
phosphine-linked 10-mer reagent containing a mismatched sequence + carboxylic acid-linked 

15 20-mer capture reagent + 30 mM EDC 4- 15 mM sulfo-NHS. Lane 6 contains azide-linked 30- 
mer template 4- 5 mM TCEP-HC1 + carboxylic acid-linked 20-mer capture reagent + 30 mM 
EDC + 15 mM sulfo-NHS (positive control in which the azide is reduced in situ by TCEP). It 
appears that incomplete denaturing of the duplex between the 30-mer template and 20-mer 
capture reagent at the onset of electrophoresis results in band blurring (lanes 2 and 4-6). 

20 [0027] FIG. 6B is a representation of a denaturing PAGE gel following DNA-templated 
azide-to-amine transformation of azide 7 from FIG. SB. Lane 1 contains azide-linked 30-mer 
template + aldehyde-linked 20-mer capture reagent + 3 mM NaBH 3 CN. Lane 2 contains azide- 
linked 30-mer template + phosphine-linked 10-mer reagent + aldehyde-linked 20-mer capture 
reagent + 3 mM NaBH 3 CN. Lane 3 contains azide-linked 30-mer template + phosphine-linked 

25 1 0-mer reagent containing a mismatched sequence + aldehyde-linked 20-mer capture reagent + 3 
mM NaBH 3 CN. Slight 50-mer captured product formation is observed in lanes 1 and 3, which 
arises from slow spontaneous reduction of the phenyl azide during the preparation of a substrate- 
linked template and during the DNA-templated reactions. The background reactivity observed in 
lanes 1 and 3 (<13 %) was subtracted to determine the reported yield for lane 2. 

30 [0028] FIG. 6C is a representation of a denaturing PAGE gel following DNA-templated 

azide-to-carboxylic acid transformation of azide 8 from FIG. SB. Lane 1 contains azide-linked 



WO 2006/079061 



PCT/US2006/002420 



-8- 

30-mer template + amine-linked 20-mer capture reagent + 30 mM EDC + 15 mM sulfo-NHS. 
Lane 2 contains azi de-linked 30-mer template 4- phosphine-linked 10-mer reagent 4- amine- 
linked 20-mer capture reagent + 30 mM EDC 4-15 mM sulfo-NHS. Lane 3 contains azide- 
Iinked 30-mer template + phosphine-linked 10-mer reagent containing a mismatched sequence H- 
5 amine-linked 20-mer capture reagent + 30 mM EDC + 15 mM sulfo-NHS. 

[0029] FIG- 6D is a representation of a denaturing PAGE gel following DNA-templated 
azide-to-thiol transformation of azide 11 from FIG. 5B using a 10% poly aery lam ide gel. Lane 1 
contains azide-linked 30-mer template 4- alkyl bromide-linked 20-mer capture reagent. Lane 2 
contains azide-linked 30-mer template + phosphine-linked 10-mer reagent 4- alkyl bromide- 
10 linked 20-mer capture reagent. Lane 3 contains azide-linked 30-mer template 4- phosphine- 
linked 10-mer reagent containing a mismatched sequence 4- alkyl bromide-linked 20-mer capture 
reagent. 

[0030] FIG. 7 is a representative MALDI-TOF spectrum from a DNA-templated functional 
group transformation (in this case, the amine product arising from azide 1). 

15 [0031] FIG. 8 depicts a reaction of a single solution containing four azides with four non- 
DNA-linked small-molecule electrophiles to generate four sequence-programmed sulfonamide, 
carbamate, urea, and thiourea products. Template 13 is attached to an oligonucleotide having 
codon sequence a. The triphenylphosphine containing transfer unit 17 is attached to an 
oligonucleotide having anti-codon sequence a\ During templated synthesis codon sequence a 

20 anneals to anti-codon sequence a'. Similarly, templates 14, 15, and 16 contain codon sequences 
b, c, and d, respectively, which anneal to transfer units 18, 19, and 20 via anti-codon sequences 
b'y c\ and d', respectively. 

[0032] FIG. 9 is a schematic illustration showing the starting reagents used and reaction 
products created in FIG. 8. 

25 [0033] FIG. 10A through FIG. 10D are drawings of HPLC traces (monitored at 260 nm) 
following HPLC analysis of reactions of amine-linked templates with small-molecule reagents. 
SM indicates unreacted starting material amine-linked template peaks. PRD indicates 
derivatized products. Unless otherwise noted, peaks other than those labeled as SM or PRD do 
not correspond to DNA-linked species as judged by UV absorption at 230 nm and by MALDI- 

30 TOF analysis. FIG. 10A shows the results of the reaction of amine-linked template 13 with 
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dansyl chloride 21 (reagents 13 and 21 are shown in FIG. 8). FIG. 10B shows the results of the 
reaction of amine-linked template 14 with ethyl chloroformate 22 (reagents 14 and 22 are shown 
in FIG. 8). FIG. 10C shows the results of the reaction of amine-linked template 15 with 4- 
methoxy phenyl isocyanate 23 (reagents 15 and 23 are shown in FIG. 8). FIG. 10D shows the 
results of the reaction of amine-linked template 16 with 6-morpholino pyridinyl 3-methoxyl 
phenyl isocyanate 24 (reagents 16 and 24 are shown in FIG. 8), 

[0034] FIG. 11A shows a MALDI-TOF spectrum of the four azide starting materials in one 
solution (reagents 13-16 in FIG. 8). FIG. 11B shows a MALDI-TOF spectrum of the four 
sequence-specific transformation products (products 25-28 in FIG. 8) of the azide starting 
materials 13-16. 

DETAILED DESCRIPTION 

[0035] The present invention is useful in the synthesis of libraries of molecules, for example, 
small molecules. The functional group transformations described herein are particularly useful 
in expanding the scope of nucleic acid-templated organic syntheses by addressing the need for 
reagents to be tethered to oligonucleotides. When the linkage of reagents to an oligonucleotide 
is not possible or convenient, these transformations allow such reagents to nevertheless 
contribute to small molecule syntheses while preserving the correspondence between 
oligonucleotide sequence and resulting product structure. 

[0036] In one aspect, the invention provides a method of synthesizing a reaction product. 
The method comprises the steps of: (a) providing a mixture comprising a first reactive unit and a 
second reactive unit under conditions to permit a reaction between the first and second reactive 
units to form a reaction intermediate; (b) providing an oligonucleotide comprising an identifying 
sequence attached to the reaction intermediate; and (c) combining the reaction intermediate with 
a free reactant selectively reactive with the reaction intermediate, thereby to produce a reaction 
product attached to the identifying sequence. In this approach, the free reactant is more reactive 
with the reaction intermediate than with either of the reactive units in the starting mixture. 

[0037] This approach is shown schematically in FIG. 1. Briefly, a first reactive unit (FRU) 
is reacted with a second, different reactive unit (SRU) to produce a reaction intermediate (RI). 
The RI is attached, preferably, covalently attached, to an identifying sequence (IS). The IS can 
be an oligonucleotide (for example, DNA, or derivatives thereof, RNA, or derivatives thereof). 
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Then, the RMS complex is combined with a free reactant (FR) under conditions to permit the FR 
to chemically transform the RI into a reaction product (RP). The RP is still linked to the IS, 
which can be used to identify RP and the synthetic history of RP. 

[0038] In one approach, the IS, for example, a nucleic acid sequence defining a specific 
codon sequence or anti-codon sequence, is linked to the FRU prior to the reaction that produces 
the RI. The IS remains linked to the RI after the reaction so as to provide an IS linked to the RI. 
Following creation of the RP, the IS remains linked to the RP so that it is possible to identify the 
RP and its synthetic history. It is contemplated that the SRU may also be linked to a sequence 
complementary to the IS. As a result, during step (a), the IS hybridizes to the sequence 
complementary to the IS so as to bring the FRU and SRU into reactive proximity. 

[0039] In another approach, the IS, for example, a nucleic acid sequence defining a specific 
codon sequence or anti-codon sequence, is linked to the RI after it has been formed by the 
reaction between FRU and SRU. The RI can then be chemically transformed via the FR into RP. 
The IS remains linked to the RP. The IS can be linked enzymatically, for example, by a 
polymerase or ligase, to the RI after formation of the RL 

[0040] In one embodiment, the FR is at least five times more reactive with the RI than with 
at least one of, and optionally all of, the reactive units or other reactive intermediates in the 
starting mixture. Furthermore, depending on the reactants and reaction conditions, in other 
embodiments the FR is at least ten times, at least fifty times, at least one hundred times, at least 
two hundred fifty times, at least five hundred times, or at least one thousand times more reactive 
with the RI than with at least one of, and optionally all of, the reactive units or other reactive 
intermediates in the starting mixture. In addition, depending upon the reactants and reaction 
conditions, the RP is synthesized with a yield greater than or equal to 50%, greater than or equal 
to 75%, greater than or equal to 85%, or greater than or equal to 98%. 

[0041] The reactivity of the FR to the RI relative to the starting materials FRU and SRU can 
be determined experimentally. The amount of product produced by combining FR and RI under 
standard reaction conditions can be determined. The amount of product produced by combining 
in equimolar amounts FR with either FRU or SRU under the same reaction conditions can be 
determined. The yields of products can be determined by standard techniques in the chemical 
arts. Based on the relative amounts of product produced in each reaction it is possible to 
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determine whether the FR is more reactive, and, if so, how much more reactive, than the FRU or 
the SRU. Similar approaches can be used to determine whether the free reactant is more reactive 
with one reaction intermediate than with other, different reactive intermediates. 

[0042] In another aspect, the invention provides a method of synthesizing a reaction product 
via nucleic acid-templated synthesis as described, for example, in U.S. Patent Application 
Publication Number US2004/0180412. The method comprises the steps of (a) providing a 
mixture comprising (i) a first reactive unit attached to a first oligonucleotide comprising a codon 
sequence, and (ii) a second reactive unit attached to a second oligonucleotide comprising an anti- 
codon sequence complementary to the codon sequence, wherein the anti-codon sequence is 
indicative of the second reactive unit; (b) annealing the codon sequence of the first 
oligonucleotide with the anti-codon sequence of the second oligonucleotide to induce a reaction 
between the first and second reactive units to form a reaction intermediate attached to at least the 
first oligonucleotide; and (c) combining the reaction intermediate with a free reactant selectively 
reactive with the reaction intermediate thereby to synthesize a reaction product attached to the 
first oligonucleotide sequence. The free reactant preferably is more reactive with the reaction 
intermediate than with at least one of, and optionally all of, the reactive units or other reaction 
intermediates in the starting mixture. 

[0043] This approach is shown schematically in FIG. 2, where a first reactive unit (FRU) is 
attached, for example, covalently attached, to a first oligonucleotide comprising a codon 
sequence (CS). The combination of the first reactive unit and the oligonucleotide can be referred 
to as a template. The codon sequence may identify the FRU, for example, like an IS. 
Alternatively, the CS may further comprise a separate identifier sequence (IS) that identifies the 
FRU. In the latter scenario, the CS can identify the second reactive unit (SRU) that reacts with 
the FRU to create the reaction intermediate (RI) and the IS can identify the FRU. 

[0044] In this approach, the SRU is attached, for example, covalently attached to a second 
oligonucleotide that contains an anti-codon sequence (ACS) complementary to the CS. The 
combination of the second reactive unit with the anti-codon sequence can be referred to as a 
transfer unit. When the template and the transfer unit are combined under the appropriate 
reaction conditions, the CS and the ACS anneal to one another to bring the FRU and SRU into 
reactive proximity. The FRU and SRU then react with one another, for example, by proximity 
catalysis, to produce RI that is still linked to CS. When combined with the free reactant (FR), 
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the RI is chemically transformed by FR into a reaction product (RP) that is still linked to the CS. 
Assuming that the oligonucleotide attached to RP contains the CS, then it is possible to 
determine what SRU was involved in the synthesis of the RI and/or the RP. Similarly, if the 
oligonucleotide attached to RP contains the IS, then it is possible to determine what FRU was 
5 involved in the synthesis of the RI and/or RP. Accordingly, this information can be used to 
determine the identity and synthetic history of RI and/or RP, 

[0045] Furthermore, it is contemplated that the FRU can be a small molecule scaffold that 
can be used during nucleic acid-templated synthesis to produce a small molecule. In particular, 
the small molecule scaffold can be used as a core on which to assemble the substituents of the 
10 small molecule. 

[0046] In one embodiment, the FR is at least five times more reactive with the RI than with 
at least one of, and optionally all of, the reactive units or other reactive intermediates in the 
starting mixture. Furthermore, depending on the reactants and reaction conditions, in other 
embodiments the FR is at least ten times, at least fifty times, at least one hundred times, at least 
15 two hundred fifty times, at least five hundred times, or at least one thousand times more reactive 
with the RI than with at least one of, and optionally all of, the reactive units or other reactive 
intermediates in the starting mixture. In addition, depending upon the reactants and reaction 
conditions, the RP is synthesized with a yield greater than or equal to 50%, greater than or equal 
to 75%, greater than or equal to 85%, or greater than or equal to 98%. 

20 [0047] In another aspect, the invention provides a method of synthesizing a reaction product. 
The method comprises the steps of: (a) providing a mixture of a population of different first 
reactive units and a second reactive unit under conditions that induce a reaction between at least 
one of the first reactive units and the second reactive unit, thereby to form a reaction 
intermediate co-existing with the population of first reactive units; (b) providing an identifying 

25 sequence attached to the reaction intermediate, wherein the sequence distinguishes the reaction 
intermediate from the first reactive units; and combining the reaction intermediate co-existing 
with the first reactive units with a free reactant capable of selectively reacting with the reaction 
intermediate, thereby synthesizing a reaction product linked to an identifying sequence, the 
reaction product co-existing with the population of first reactive units. The free reactant is more 

30 reactive with the reaction intermediate than with at least one of the reactive units or other 
reactive intermediates in the starting mixture. 
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[0048] This approach is shown schematically in FIG. 3 and is similar to the approach shown 
schematically in FIG. 1 except that the first reaction unit is present as a mixture of first reactive 
units. Briefly, a starting mixture containing four first reactive units denoted FRUi, FRU 2 , FRU 3 , 
and FRU 4 are combined with a second reactive unit denoted as SRUi. Under the appropriate 
conditions, FRUi and SRUi react with one another to produce a reaction intermediate denoted as 
RIi. An identifier sequence (IS) can be attached to the RIj which identifies RIi. Thereafter, a 
free reactant (FR) is combined to the mixture under conditions for the RI to be chemically 
transformed into a reaction product denoted RPJ]. The RPi is still linked to the IS which can be 
used to identify RP and the synthetic history of RP. In addition, it is contemplated that RPi can 
be exposed to other rounds of functional group transformations, especially where the FRU is a 
small molecule scaffold, to produce further modified products. 

[0049] In another aspect, the invention provides a method of synthesizing a reaction product 
via nucleic acid-templated synthesis. The method comprises the steps of: (a) providing a mixture 
comprising (i) a plurality of different first reactive units each linked to first oligonucleotides 
comprising a codon sequence, wherein each oligonucleotide sequence is also indicative of the 
first reactive unit attached thereto; (b) providing a second reactive unit attached to a second 
oligonucleotide comprising an anti-codon sequence complementary to the codon sequence of at 
least one first reactive unit, wherein the anti-codon sequence is indicative of the second reactive 
unit; (c) annealing the codon sequence of at least one of the first oligonucleotides with the anti- 
codon sequence of the second oligonucleotide to induce a reaction between the first and second 
reactive units to form a first reaction intermediate linked at least to a first oligonucleotide; and 
(d) combining the first reaction intermediate with a free reactant selectively reactive with the 
first reaction intermediate, thereby synthesizing a first reaction product linked to the identifying 
sequence, wherein the free reactant is more reactive with the first reaction intermediate than with 
at least one of, and optionally all of, the reactive units or other reactive intermediates in the 
mixture. 

[0050] This approach is shown schematically in FIG. 4 and is similar to the approach shown 
schematically in FIG. 2 except the first reactive unit is present as a mixture of first reactive units. 
Briefly, the initial reaction contains a plurality of templates, where each template contains a first 
reactive unit (denoted as FRUi, FRU 2 , FRU 3 and FRU 4 ) attached, preferably, covalently 
attached, to its own respective oligonucleotide containing its own codon unique sequence (CS). 
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The oligonucleotide preferably also contains an identifier sequence (IS) that identifies what first 
reactive unit is attached to what codon sequence of the template. A transfer unit containing a 
second reactive unit (SRU) attached, preferably, covalently attached, to an oligonucleotide 
containing an anti-codon sequence complementary to the codon sequence is combined with the 
templates. The ACS of the transfer unit anneals to the CS of the template to bring the FRUi and 
SRU into reactive proximity, whereupon the FRUi and SRU react with one another to produce 
reaction intermediate (RIj) that still remains attached to the oligonucleotide containing the CS. 
When combined with the free reactant (FR), the RI is chemically transferred by the FR to 
produce the reaction product (RP) that is still linked to the CS. Assuming that the 
oligonucleotide attached to the RP contains a CS 9 then it is possible to determine what SRU was 
involved in the synthesis of RI and/or the RP. Similarly, if the oligonucleotide attached to RP 
contains the IS, then it is possible to determine what FRU was involved in the synthesis of the RI 
and/or the RP. Accordingly, this information can be used to determine the identity and synthetic 
history of RI and/or RP. As discussed, it is contemplated that the first reactive units can be small 
molecule scaffolds useful in the design and synthesis of a small molecule library. 

[0051] In addition, it is possible that multiple different functional group transformations can 
occur simultaneously in the same reaction vessel. Accordingly, the method can also include the 
additional steps of: providing a third different reactive unit linked to a third oligonucleotide 
comprising an anti-codon sequence complementary to the codon sequence of at least one 
different first reactive unit, wherein the anti-codon sequence is indicative of the third reactive 
unit; annealing the codon sequence of a different one of the first oligonucleotides with the anti- 
codon sequence of the third oligonucleotide to induce a reaction between the first and third 
reactive units to form a second reaction intermediate attached at least to a first, different 
oligonucleotide; and combining the second reaction intermediate with a free reactant selectively 
reactive with the second reaction intermediate, thereby synthesizing a second reaction product 
attached to the identifying sequence, wherein the free reactant is more reactive with the second 
reaction intermediate than with at least one of, and optionally all of, the reactive units or other 
reactive intermediates in the mixture. 

[0052] In addition, it is contemplated that the reaction products can be exposed to other 
rounds of functional group transformations, for example, where FRUi is a small molecule 
scaffold, to produce further modified products. 
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[0053] As will be appreciated by those skilled in the art, the method of the invention can be 
used to expand the range of chemistries that can be used during nucleic acid-templated chemical 
syntheses. General considerations concerning the selection and use of templates, transfer units, 
reaction conditions, reaction chemistries, selection procedures are know in the art. A general 
5 discussion of these considerations follows. 

I. TEMPLATE CONSIDERATIONS 

[0054] The nucleic acid template can direct a wide variety of chemical reactions without 
obvious structural requirements by sequence-specifically recruiting reactants linked to 
complementary oligonucleotides. During synthesis, the template hybridizes or anneals to one or 
10 more transfer units to direct the synthesis of a reaction intermediate that can subsequently be 
converted by a free reactant into a reaction product. The reaction product then is selected or 
screened based on certain criteria, such as the ability to bind to a preselected target molecule. 
Once the reaction product has been identified, the associated template can then be sequenced to 
decode the synthetic history of the reaction intermediate and/or the reaction product. 

1 5 (i) Template Format 

[0055] The length of the template may vary greatly depending upon the type of the nucleic 
acid-templated synthesis contemplated. For example, in certain embodiments, the template may 
be from 10 to 10,000 nucleotides in length, from 20 to 1,000 nucleotides in length, from 20 to 
400 nucleotides in length, from 40 to 1,000 nucleotides in length, or from 40 to 400 nucleotides 
20 in length. The length of the template will of course depend on, for example, the length of the 

codons, the complexity of the library, the complexity and/or size of a reaction product, the use of 
spacer sequences, etc. 

[0056] The template may incorporate a hairpin loop on one end terminating in a reactive unit 
that can interact with one or more reactive units associated with transfer units. For example, a 
25 DNA template can comprise a hairpin loop terminating in a 5 5 -amind group, which may or may 
not be protected. The amino group may act as an initiation point for formation of an unnatural 
polymer or small molecule. 
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(ii) Codon Usage 

[0057] It is contemplated that the sequence of the template may be designed in a number of 
ways. For example, the length of the codon must be determined and the codon sequences must 
be set. If a codon length of two is used, then using the four naturally occurring bases only 16 
5 possible combinations are available to be used in encoding the library. If the length of the codon 
is increased to three (the number Nature uses in encoding proteins), the number of possible 
combinations increases to 64. If the length of the codon is increased to four, the number of 
possible combinations increases to 256. Other factors to be considered in determining the length 
of the codon are mismatching, frame-shifting, complexity of library, etc. As the length of the 
10 codon is increased up to a certain point the number of mismatches is decreased; however, 
excessively long codons likely will hybridize despite mismatched base pairs. 

[0058] Although the length of the codons may vary, the codons may range from 2 to 50 
nucleotides, from 2 to 40 nucleotides, from 2 to 30 nucleotides, from 2 to 20 nucleotides, from 2 
to 15 nucleotides, from 2 to 10 nucleotides, from 3 to 50 nucleotides, from 3 to 40 nucleotides, 

15 from 3 to 30 nucleotides, from 3 to 20 nucleotides, from 3 to 15 nucleotides, from 3 to 10 

nucleotides, from 4 to 50 nucleotides, from 4 to 40 nucleotides, from 4 to 30 nucleotides, from 4 
to 20 nucleotides, from 4 to 15 nucleotides, from 4 to 10 nucleotides, from 5 to 50 nucleotides, 
from 5 to 40 nucleotides, from 5 to 30 nucleotides, from 5 to 20 nucleotides, from 5 to 15 
nucleotides, from 5 to 10 nucleotides, from 6 to 50 nucleotides, from 6 to 40 nucleotides, from 6 

20 to 30 nucleotides, from 6 to 20 nucleotides, from 6 to 15 nucleotides, from 6 to 10 nucleotides, 
from 7 to 50 nucleotides, from 7 to 40 nucleotides, from 7 to 30 nucleotides, from 7 to 20 
nucleotides, from 7 to 15 nucleotides, from 7 to 10 nucleotides, from 8 to 50 nucleotides, from 8 
to 40 nucleotides, from 8 to 30 nucleotides, from 8 to 20 nucleotides, from 8 to 15 nucleotides, 
from 8 to 10 nucleotides, from 9 to 50 nucleotides, from 9 to 40 nucleotides, from 9 to 30 
25 nucleotides, from 9 to 20 nucleotides, from 9 to 15 nucleotides, from 9 to 10 nucleotides. 
Codons, however, preferably are 3, 4, 5, 6, 7, 8, 9 or 10 nucleotides in length. 

[0059] A set of codons used in the template preferably maximizes the number of mismatches 
between any two codons within a codon set to ensure that only the proper anti-codons of the 
transfer units anneal to the codon sites of the template. Furthermore, it is important that the 
30 template has mismatches between all the members of one codon set and all the codons of a 

different codon set to ensure that the anti-codons do not inadvertently bind to the wrong codon 
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set. The choice of exemplary codon sets and methods of creating functional codon sets are 
described, for example, in U.S. Patent Publication No. US 2004/0180412. Using this and other 
approaches, different sets of codons can be generated so that no codons are repeated. 

[0060] When the nucleic acid-templated synthesis is used to produce a polymer or a small 
5 molecule, spacer sequences may also be placed between the codons to prevent frame shifting. 
For example, the bases of the template that encode a polymer subunit (the "genetic code" for the 
polymer) may be chosen so as to minimize the possibility of out-of-frame annealing. These 
genetic codes reduce undesired frameshifted nucleic acid-templated polymer translation and 
differ in the range of expected melting temperatures and in the minimum number of mismatches 
1 0 that result during out-of-frame annealing. 

(Hi) Template Synthesis 

[0061] The templates may be synthesized using methodologies well known in the art. These 
methods include both in vivo and in vitro methods including PCR, plasmid preparation, 
endonuclease digestion, solid phase synthesis (for example, using an automated synthesizer), in 
15 vitro transcription, strand separation, etc. Following synthesis, the template, when desired may 
be attached (for example, covalently or non covalently attached) with a reactive unit of interest 
using standard coupling chemistries known in the art. 

[0062] An efficient method to synthesize a large variety of templates is to use a "split-pool" 
technique. The oligonucleotides are synthesized using standard 3 5 to 5' chemistries. First, the 

20 constant 3 5 end is synthesized. This is then split into n different vessels, where n is the number 
of different codons to appear at that position in the template. For each vessel, one of the n 
different codons is synthesized on the (growing) 5' end of the constant 3' end. Thus, each vessel 
contains, from 5' to 3', a different codon attached to a constant 3' end. The n vessels then are 
pooled, so that a single vessel contains n different codons attached to the constant 3 5 end. Any 

25 constant bases adjacent the 5' end of the codon are now synthesized. The pool then is split into 
m different vessels, where m is the number of different codons to appear at the next (more 5') 
position of the template. A different codon is synthesized (at the 5' end of the growing 
oligonucleotide) in each of the m vessels. The resulting oligonucleotides are pooled in a single 
vessel. Splitting, synthesizing, and pooling are repeated as required to synthesize all codons and 

30 constant regions in the oligonucleotides. 
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II. TRANSFER UNITS 

[0063] A transfer unit comprises an oligonucleotide containing an anti-codon sequence and a 
reactive unit. The anti-codons are designed to be complementary to the codons present in the 
template. Accordingly, the sequences used in the template and the codon lengths should be 
considered when designing the anti-codons. Any molecule complementary to a codon used in 
the template may be used, including natural or non-natural nucleotides. In certain embodiments, 
the codons include one or more bases found in nature (i.e., thymidine, uracil, guanidine, 
cytosine, and adenine). Thus, the anti-codon can include one or more nucleotides normally 
found in Nature with a base, a sugar, and an optional phosphate group. 

[0064] As discussed above, the anti-codon is associated with a particular type of reactive unit 
to form a transfer unit. The reactive unit may represent a distinct entity or may be part of the 
functionality of the anti-codon unit. In certain other embodiments, where a small molecule 
library is to be created rather than a polymer library, the anti-codon generally is associated with a 
reactive unit or reactant used to modify a small molecule scaffold. In certain embodiments, the 
reactant is linked to the anti-codon via a linker long enough to allow the reactant to come into 
reactive proximity with the small molecule scaffold. The linker preferably has a length and 
composition to permit intramolecular reactions but yet minimize intermolecular reactions. The 
reactants include a variety of reagents as demonstrated by the wide range of reactions that can be 
utilized in nucleic acid-templated synthesis and can be any chemical group, catalyst (e.g., 
organometallic compounds), or reactive moiety (e.g., electrophiles, nucleophiles) known in the 
chemical arts. 

[0065] In certain embodiments, each anti-codon sequence is associated with one monomer 
type. For example, the anti-codon sequence ATTAG may be associated with a carbamate residue 
with an isobutyl side chain, and the anti-codon sequence CATAG may be associated with a 
carbamate residue with a phenyl side chain. This one-for-one mapping of anti-codon to 
monomer units allows the decoding of any polymer of the library by sequencing the nucleic acid 
template used in the synthesis and allows synthesis of the same polymer or a related polymer by 
knowing the sequence of the original polymer. By changing (e.g., mutating) the sequence of the 
template, different monomer units may be introduced, thereby allowing the synthesis of related 
polymers, which can subsequently be selected and evolved. In certain preferred embodiments, 
several anti-codons may code for one monomer unit as is the case in Nature. 
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[0066] The anti-codon can be associated with the reactant through a linker moiety. The 
linkage can be cleavable by light, oxidation, hydrolysis, exposure to acid, exposure to base, 
reduction, etc. Fruchtel et al (1996) Angew. Chem. Int. Ed. Engl. 35: 17 describes a variety of 
linkages useful in the practice of the invention. The linker facilitates contact of the reactant with 
the small molecule scaffold and in certain embodiments, depending on the desired reaction, 
positions DNA as a leaving group ("autocleavable" strategy), or may link reactive groups to the 
template via the "scarless" linker strategy (which yields product without leaving behind an 
additional atom or atoms having chemical functionality), or a "useful scar" strategy (in which a 
portion of the linker is left behind to be functionalized in subsequent steps following linker 
cleavage). Useful linkers, their design and use are described in U.S. Patent Application 
Publication No. US 2004/0180412. 

[0067] The specific annealing of transfer units to templates permits the use of transfer units 
at concentrations lower than concentrations used in many traditional organic syntheses. Thus, 
transfer units can be used at submillimolar concentrations (e.g. less than 100 jxM, less than 10 
|uM, less than 1 |uM, less than 100 nM, or less than 10 nM). 

III. CHEMICAL REACTIONS 

[0068] A variety of compounds and/or libraries can be prepared using the methods described 
herein. In certain embodiments, compounds that are not, or do not resemble, nucleic acids or 
analogs thereof, are synthesized according to the method of the invention. In certain other 
embodiments, compounds that are not, or do not resemble, proteins, peptides, or analogs thereof, 
are synthesized according to the method of the invention. 

(i) Coupling Reactions for Small Molecule Synthesis 

[0069] In some embodiments, it is possible to create compounds such as small molecules 
using the methods described herein. These small molecules may be like natural products, non- 
polymeric, and/or non-oligomeric. The substantial interest in small molecules is due in part to 
their use as the active ingredient in many pharmaceutical preparations although they may also be 
used, for example, as catalysts, materials, or additives. 

[0070] In synthesizing small molecules using the method of the present invention, an 
evolvable template can be used. The template can include a small molecule scaffold upon which 
the small molecule is to be built, or a small molecule scaffold may be added to the template. The 



WO 2006/079061 



PCT/US2006/002420 



-20- 

small molecule scaffold can be any chemical compound with two or more sites for 
functionalization. For example, the small molecule scaffold can include a ring system (e.g., the 
ABCD steroid ring system found in cholesterol) with functionalizable groups coupled to the 
atoms making up the rings. In another example, the small molecule may be the underlying 
5 structure of a pharmaceutical agent such as morphine, epothilone or a cephalosporin antibiotic. 
The sites or groups to be functionalized on the small molecule scaffold may be protected using 
methods and protecting groups known in the art. The protecting groups used in a small molecule 
scaffold may be orthogonal to one another so that protecting groups can be removed one at a 
time. 

1 0 [0071] In this approach, the transfer units comprise an anti-codon associated with a reactant 
or a building block for use in modifying, adding to, or taking away from the small molecule 
scaffold. The reactants or building blocks may be, for example, electrophiles (e.g., acetyl, 
amides, acid chlorides, esters, nitriles, imines), nucleophiles (e.g., amines, hydroxyl groups, 
thiols), catalysts (e.g., organometallic catalysts), or side chains. The transfer units are allowed to 

15 contact the template under hydridizing conditions. As a result of oligonucleotide annealing, the 
attached reactant or building block is allowed to react with a site on the small molecule scaffold 
to produce one or more reaction intermediates. The reaction intermediates can then be reacted 
with a free reactant to produce a reaction product. 

[0072] The reaction conditions, linker, reactant, and site to be functionalized are chosen to 
20 avoid intermolecular reactions and accelerate intramolecular reactions. Sequential or 

simultaneous contacting of the template with transfer units can be employed depending on the 
particular compound to be synthesized. 

[0073] After the sites on the scaffold have been modified, the newly synthesized small 
molecule remains associated with the template that encoded its synthesis. Decoding the 
25 sequence of the template permits the decon volution of the synthetic history and thereby the 

structure of the small molecule. The template can also be amplified in order to create more of 
the desired small molecule and/or the template can be evolved (mutagenized) to create related 
small molecules. The small molecule can also be cleaved from the template for purification or 
screening. 
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(it) Coupling Reactions for Polymer Synthesis 

[0074] In certain embodiments, polymers, specifically unnatural polymers, can be prepared 
using the techniques described herein. Exemplary unnatural polymers include, but are not 
limited to, peptide nucleic acid (PNA) polymers, polycarbamates, polyureas, polyesters, 
5 polyacrylate, polyalkylene (e.g., polyethylene, polypropylene), polycarbonates, polypeptides 
with unnatural stereochemistry, polypeptides with unnatural amino acids, and combination 
thereof. In certain embodiments, the polymers comprise at least 10, 25, 75, 100, 125, 150 
monomer units or more. The polymers synthesized using the methodologies described herein 
may be used, for example, as catalysts, pharmaceuticals, metal chelators, or catalysts. 

10 [0075] In preparing certain unnatural polymers, the monomer units attached to the anti- 

codons may be any monomers or oligomers capable of being joined together to form a polymer. 
The monomer units may be, for example, carbamates, D-amino acids, unnatural amino acids, 
PNAs, ureas, hydroxy acids, esters, carbonates, acrylates, or ethers. 

(Hi) Reaction Conditions 

15 [0076] It is understood that nucleic acid-templated reactions, for example, nucleic acid- 

templated reactions to produce reaction intermediates, can occur in aqueous or non-aqueous (i.e., 
organic) solutions, or a mixture of one or more aqueous and non-aqueous solutions. In aqueous 
solutions, reactions can be performed at pH ranges from about 2 to about 12, or preferably from 
about 2 to about 10, or more preferably from about 4 to about 10. The reactions used in DNA- 

20 templated chemistry preferably should not require very basic conditions (e.g., pH > 12, pH > 10) 
or very acidic conditions (e.g., pH < 1, pH < 2, pH < 4), because extreme conditions may lead to 
degradation or modification of the nucleic acid template and/or molecule (for example, the 
polymer, or small molecule) being synthesized. The aqueous solution can contain one or more 
inorganic salts, including, but not limited to, NaCl, Na 2 S0 4 , KC1, Mg +2 , Mn +2 , etc., at various 

25 concentrations. 

[0077] Organic solvents suitable for nucleic acid-templated reactions include, but are not 
limited to, methylene chloride, chloroform, dimethylformamide, and organic alcohols, including 
methanol and ethanol. To permit quantitative dissolution of reaction components in organic 
solvents, quaternized ammonium salts, such as, for example, long chain tetraalkylammonium 
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salts, can be added (Jost et al (1989) NUCLEIC ACIDS Res. 17: 2143; Mel'nikov et al (1999) 
Langmuir 15: 1923-1928). 

[0078] Nucleic acid-templated reactions may require a catalyst, such as, for example, 
homogeneous, heterogeneous, phase transfer, and asymmetric catalysis. In other embodiments, a 
catalyst is not required. The presence of additional, accessory reagents not linked to a nucleic 
acid are preferred in some embodiments. Useful accessory reagents can include, for example, 
oxidizing agents (e.g., NaI0 4 ); reducing agents (e.g., NaCNBH 3 ); activating reagents (e.g., EDC, 
NHS, and sulfo-NHS); transition metals such as nickel (e.g., Ni(N0 3 ) 2 ), rhodium (e.g. RhCI 3 ), 
ruthenium (e.g. RuCl 3 ), copper (e.g. Cu(N0 3 ) 2 ), cobalt (e.g. CoCl 2 ), iron (e.g. Fe(N0 3 ) 3 ), 
osmium (e.g. Os0 4 ), titanium (e.g. TiCl 4 or titanium tetraisopropoxide), palladium (e.g. 
NaPdCl 4 ), or Ln; transition metal ligands (e.g., phosphines, amines, and halides); Lewis acids; 
and Lewis bases. 

[0079] Reaction conditions preferably are optimized to suit the nature of the reactive units 
and oligonucleotides used. It is understood that the choice of reagents, for example, free 
reactants, and the reaction conditions used to create the reaction intermediates and to convert the 
reaction intermediates into final products will depend upon the particular compounds and 
libraries to be produced. It is contemplated, however, that the choice of reagents and reaction 
conditions is within the level of skill in the art. 

(iv) Classes of Chemical Reactions 

[0080] It is understood that a large variety of chemical reactions can be used to create the 
reaction intermediates and/or to create the reaction products from the reaction intermediates. 
Known chemical reactions for synthesizing polymers, small molecules, or other molecules can 
be used in nucleic acid-templated reactions. Thus, reactions such as those listed in March 's 
Advanced Organic Chemistry, Organic Reactions, Organic Syntheses, organic text books, 
journals such as Journal of the American Chemical Society, Journal of Organic Chemistry, 
Tetrahedron, etc., and Carruther's Some Modern Methods of Organic Chemistiy can be used. 
The chosen reactions preferably are compatible with nucleic acids such as DNA or RNA or are 
compatible with the modified nucleic acids used as the template. 

[0081] Notwithstanding the foregoing, it is contemplated that the invention is particularly 
useful in performing certain functional group transformations, which include, without limitation, 



WO 2006/079061 



PCT/US2006/002420 



-23 - 

azide-to-amine transformations, azide-to-thiol transformations, azide-to-carboxylic acid 
transformations, hydroxyl-to-amirie transformations, hydroxyl-to-thiol transformations, acetal- 
to-aldehyde transformations, ketal-to-ketone transformations, carbonate-to-hydroxyl group 
transformations, carbamate-to-amine transformations, thiocarbonate-to-thiol transformations, 
5 nitro group-to-amine transformations, sulfonamide-to-amine transformations, alkene-to-epoxide 
transformations, a,(3-unsaturated ketone-to-epoxide transformations, epoxide-to-l,2-diol 
transformations, epoxide-to- 1 ,2-hydroxy amine transformations, epoxide-to-l,2-hydroxy sulfide 
transformations, alkene-to-aziridine transformations, aziridine-to-l,2-diamine transformations, 
aziridine-to-l,2-amino sulfide transformations, phosphonate ester-to-phosphonic acid 
10 transformations, imide-to-amine transformations, and nitrile-to-carboxylic acid transformations. 
Other exemplary transformations may be found, for example, in Greene et al (ed.) (1999) 
Protective Groups in Organic Synthesis 3rd ed., Wiley-Interscience, and in Kocienski 
(1994) Protecting Groups, Thieme. 

IV. SELECTION, SCREENING AND IDENTIFICATION OF PRODUCTS 

1 5 (i) Selection and Screening Approaches 

[0082] Selection and/or screening for reaction products with desired activities (such as 
catalytic activity, binding affinity, or a particular effect in an activity assay) may be performed 
using methodologies known and used in the art. For example, affinity selections may be 
performed according to the principles used in library-based selection methods such as phage 

20 display, polysome display, and mRNA-fusion protein displayed peptides. Selection for catalytic 
activity may be performed by affinity selections on transition-state analog affinity columns 
(Baca et al (1997) Proc. Natl. Acad. Sci. USA 94(19): 10063-8) or by function-based 
selection schemes (Pedersen et al (1998) Proc. Natl. Acad. Sci. USA 95(18): 10523-8). Since 
minute quantities of DNA (~10~ 20 mol) can be amplified by PGR (Kramer et al (1999) CURRENT 

25 Protocols in Molecular Biology (ed. Ausubel, F. M.) 15.1-15.3, Wiley), these selections 
can be conducted on a scale ten or more orders of magnitude less than that required for reaction 
analysis by current methods, making a truly broad search both economical and efficient. 

[0083] The templates and reaction products can be selected (or screened) for binding to a 
target molecule. In this context, selection or partitioning means any process whereby a library 
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member bound to a target molecule is separated from library members not bound to target 
molecules. Selection can be accomplished by various methods known in the art. 

[0084] The templates of the present invention contain a built-in function for direct selection 
and amplification. In most applications, binding to a target molecule preferably is selective, such 
5 that the template and the resulting reaction product bind preferentially with a specific target 
molecule, perhaps preventing or inducing a specific biological effect. Ultimately, a binding 
molecule identified using the present invention may be useful as a therapeutic and/or diagnostic 
agent. Once the selection is complete, the selected templates optionally can be amplified and 
sequenced. The selected reaction products, if present in sufficient quantity, can be separated 
10 from the templates, purified (e.g., by HPLC, column chromatography, or other chromatographic 
method), and further characterized. 

[0085] The selection strategy can be carried out to allow selection against almost any target. 
Importantly, the selection strategy does not require any detailed structural information about the 
target molecule or about the molecules in the libraries. The entire process is driven by the 
15 binding affinity involved in the specific recognition and binding of the molecules in the library to 
a given target. 

[0086] The linkage between the template molecule and reaction product allows rapid 
identification of binding molecules using various selection strategies. Nucleic acid-templated 
syntheses broadly permit identifying binding molecules for any known target molecule. In 
20 addition, novel unknown targets can be discovered by isolating binding molecules against 

unknown antigens (epitopes) and using these binding molecules for identification and validation. 

[0087] Selection of binding molecules from a library can be performed in any format to 
identify optimal binding molecules. Binding selections typically involve immobilizing the 
desired target molecule, adding a library of potential binders, and removing non-binders by 

25 washing. When the molecules showing low affinity for an immobilized target are washed away, 
the molecules with a stronger affinity generally remain attached to the target. The enriched 
population remaining bound to the target after stringent washing is preferably eluted with, for 
example, acid, chaotropic salts, heat, competitive elution with a known ligand or by proteolytic 
release of the target and/or of template molecules. The eluted templates are suitable for PGR, 

30 leading to many orders of amplification, whereby essentially each selected template becomes 
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available at a greatly increased copy number for cloning, sequencing, and/or further enrichment 
or diversification. 

[0088] The target molecule (peptide, protein, DNA or other antigen) can be immobilized on 
a solid support, for example, a container wall, a wall of a microtiter plate well. The library 
5 preferably is dissolved in aqueous binding buffer in one pot and equilibrated in the presence of 
immobilized target molecule. Non-binders are washed away with buffer. Those molecules that 
may be binding to the target molecule through their attached DNA templates rather than through 
their synthetic moieties can be eliminated by washing the bound library with unfunctionalized 
templates lacking PGR primer binding sites. Remaining bound library members then can be 
10 eluted, for example, by denaturation. 

[0089] Alternatively, the target molecule can be immobilized on beads, particularly if there 
is doubt that the target molecule will adsorb sufficiently to a container wall, as may be the case 
for an unfolded target eluted from an SDS-PAGE gel. The derivatized beads can then be used to 
separate high-affinity library members from nonbinders by simply sedimenting the beads in a 
15 benchtop centrifuge. Alternatively, the beads can be used to make an affinity column. In such 
cases, the library is passed through the column one or more times to permit binding. The column 
then is washed to remove nonbinding library members. Magnetic beads are essentially a variant 
on the above; the target is attached to magnetic beads which are then used in the selection. 

[0090] Library members that bind a target molecule can be released by denaturation, acid, or 
20 chaotropic salts. Alternatively, elution conditions can be more specific to reduce background or 
to select for a desired specificity. Elution can be accomplished using proteolysis to cleave a 
linker between the target molecule and the immobilizing surface or between the reaction product 
and the template. Also, elution can be accomplished by competition with a known competitive 
ligand for the target molecule. Alternatively, a PGR reaction can be performed directly in the 
25 presence of the washed target molecules at the end of the selection procedure. Thus, the binding 
molecules need not be elutable from the target to be selectable since only the template is needed 
for further amplification or cloning, not the reaction product itself. Indeed, some target 
molecules bind the most avid ligands so tightly that elution would be difficult. 



WO 2006/079061 



PCT/US2006/002420 



-26- 

(ii) Identification of Products 

[0091] Once all rounds of selection are complete, the templates which are, or formerly were, 
attached to the selected reaction product preferably are amplified using any suitable technique to 
facilitate sequencing or other subsequent manipulation of the templates. Natural 
oligonucleotides can be amplified by any state of the art method. These methods include, for 
example, polymerase chain reaction (PCR); nucleic acid sequence-based amplification (see, for 
example, Compton (1991) NATURE 350: 91-92), amplified anti-sense RNA (see, for example, 
van Gelder et al (1988) Proc. Natl. Acad. Sci. USA 85: 77652-77656); self-sustained 
sequence replication systems (Gnatelli et al (1990) Proc. Natl. Acad. Sci. USA 87: 1874- 
1878); polymerase-independent amplification (see, for example, Schmidt et al (1997) Nucleic 
Acids Res. 25: 4797-4802, and in vivo amplification of plasmids carrying cloned DNA 
fragments. Descriptions of PCR methods are found, for example, in Saiki et al (1985) Science 
230: 1350-1354; Scharfefa/. (1986) Science 233: 1076-1078; and in U.S. Patent No. 4,683,202. 
Ligase-mediated amplification methods such as Ligase Chain Reaction (LCR) may also be used. 
In general, any means allowing faithful, efficient amplification of selected nucleic acid 
sequences can be employed in the method of the present invention. It is preferable, although not 
necessary, that the proportionate representations of the sequences after amplification reflect the 
relative proportions of sequences in the mixture before amplification. 

[0092] For non-natural nucleotides the choices of efficient amplification procedures are 
fewer. As non-natural nucleotides can be incorporated by certain enzymes including 
polymerases it will be possible to perform manual polymerase chain reaction by adding the 
polymerase during each extension cycle. 

[0093] Once amplified, the sequences of the template that encoded a product of interest can 
be determined. Sequencing, for example, can be performed by a standard dideoxy chain 
termination method, or by chemical sequencing, for example, using the Maxam-Gilbert 
sequencing procedure. Alternatively, the sequence of the template (or, if a long template is used, 
the variable portion(s) thereof) can be determined by hybridization experiments. For example, a 
single-stranded template molecule associated with a detectable moiety such as a fluorescent 
moiety is exposed to a chip bearing a large number of clonal populations of single-stranded 
nucleic acids or nucleic acid analogs of known sequence, each clonal population being present at 
a particular addressable location on the chip. The template sequences are permitted to anneal to 



WO 2006/079061 



PCT/US2006/002420 



-27- 

the chip sequences. The position of the detectable moieties on the chip then is determined. 
Based upon the location of the detectable moiety and the immobilized sequence at that location, 
the sequence of the template can be determined. It is contemplated that large numbers of such 
oligonucleotides can be immobilized in an array on a chip or other solid support. 

5 [0094] Libraries can be evolved by introducing mutations at the DNA level, for example, 
using error-prone PGR (Cadwell et al (1992) PCR Methods Appl. 2: 28) or by subjecting the 
DNA to in vitro homologous recombination (Stemmer (1994) Proc. Natl. Acad. Scl USA 91 : 
10747; Stemmer (1994) Nature 370: 389) or by cassette mutagenesis. Template evolution and 
evolutionary synthesis are described, for example, in U.S. Patent Application, Publication No. 
10 US 2004/0180412. 

[0095] The following examples contain important additional information, exemplification 
and guidance that can be adapted to the practice of this invention in its various embodiments and 
equivalents thereof. Practice of the invention will be more fully understood from these following 
examples, which are presented herein for illustrative purpose only, and should not be construed 
15 as limiting in anyway. 

EXAMPLES 

[0096] The following examples demonstrate the feasibility of sequence programmed 
functional group transformations. Examples 1 and 2 describe three sequence-programmed 
functional group transformations, namely-azide to-amine, azide-to-thiol, and azide-to-carboxylic 

20 acid transformations where the end products of the transformations have been characterized by 
gel electrophoresis (Example 1) or by mass spectrometry (Example 2). Example 3 shows that it 
is possible to transform amine-linked templates into a sulfonamide, a carbamate, a urea or a 
thiourea using small molecule reagents, for example, sulfonyl chloride, chloroformate, 
isocyanate, and isothiocyanate reactants not linked to DNA. Example 4 shows that it is possible 

25 to sequence-specifically transform, in a single-solution, a mixture of organic azides into amine 
intermediates and then sequence-specifically transform the amine intermediates into 
sulfonamide, carbamate, urea, and thiourea products using free reactants (e.g., sulfonyl chloride, 
chloroformate, isocyanate, and isothiocyanate) not linked to DNA. 
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Example 1. DNA-Templated Transformation of Azides into Primary Amines, Carboxylic 
Acids, and Thiols (Characterization by PAGE) 

[0097] This example describes sequence-programmed functional group transformations 
where an azide can be specifically converted into an amine, a thiol, or a carboxylic acid. The 
5 individual reaction schemes and the resulting reaction yields are shown in FIGS. 5A and 5B. 

I. Materials and Methods 

(i) Synthesis of Azido Acids 

[0098] Azido substrates for the synthesis of compounds 1-12 shown in FIG. 5B were 
prepared from the corresponding carboxylic acid precursors as follows: 

10 [0099] Azido Acetic Acid (Used to produce template 1 in FIG. 5B). This reagent was 

produced as described in Lundquist et al. (2001) Org Lett. 3: 781. The product was found to 
have the following characteristics: *H NMR (300 MHz, CDC1 3 ) 8 3.96 (2H, s). 

[00100] Azido-3 -Methyl Pentanoic Acid (Used to produce template 2 in FIG. 5B). This 
reagent was produced as described in Lundquist et al. (2001) supra. The product was found to 
15 have the following characteristics: *H NMR (300 MHz, CDC1 3 ) 8 3.92(1H, br), 3.88 (2H, dd, J 
= 5.7 Hz, J= 9 Hz), 1.77 (2H 5 m) 5 0.99 (6H, t, J= 6.6 Hz). 

[00101] 4-Azidomethvlbenzoic Acid (Used to prepare template 3 in FIG, 5B). Sodium azide 
(1.3 g, 20 mmol) and 18-crown-6 ether (0.2 mL, 1 mmol) were dissolved in DMSO (4 mL). To 
the resulting solution was added 4-chloromethyl benzoic acid (1.71g, lOmmol) and the reaction 
20 mixture was stirred 12 h at 25 °C. The reaction was diluted in EtOAc, washed with 0.1 N HC1 
(2x), then washed with brine. The organic layer was dried with Na 2 S0 4 and concentrated to 
provide a white solid (1.75 g, quant.). The resulting product was found to have the following 
characteristics: *H NMR (300 MHz, CDC1 3 ) 5 8.15 (2H, d, J= 8.4 Hz) 7.45 (2H, d, J= 8.4 Hz) 
(1H, s); ESMS calculated for C 8 H 6 N30 2 : 176.0460; observed: 176.0461. 

25 [00102] 1-Azidocvclohexvl Carboxylic Acid (Used to prepare template 5 in FIG. 5B) and 
Azidoisoglutamic Acid (Used to prepare template 4 in FIG. 5B) were synthesized according to 
the method described for the synthesis of azido acids by diazo transfer in Lundquist et al. (2001) 
supra. 1-azidocyclohexyl carboxylic acid was found to have the following characteristics: l U 
NMR (300 MHz, CDC1 3 ) 5 1.86 (4H, m) 1.63 (4H, m) 1.36 (2H, m); CIMS calculated for 
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C7H7N3O2 (M+NH4 + ): 1187.1195; observed: 1187.1188. Azidoisoglutamic acid was found to 
have the following characteristics: 'H NMR (300 MHz, CDC1 3 ) 8 6.43 (2H, d, J= 17.4 Hz) 3.14 
(1H, dd, J= 14.1 Hz, J= 6.9 Hz) 2.54 (2H, t J= 7.5 Hz) 2.23 (2H, dd, J= 13.6 HzJ= 6.6 Hz); 
CIMS calculated for C7H7N3O2 (M+NH 4 + ): 190.0931; observed: 190.040. 

[00103] 1-Azido M ethyl Benzoic Acid (Used to produce template 6 in FIG. 5B). This 
reagent was produced as described in Wada et al. (2001) Tetrahedron Lett. 42: 1069-72, and 
also in Love et al. (2001). J. ORG Chem. 66: 68165-76. The product was found to have the 
following characteristics: 'H NMR (300 MHz, CDC1 3 ) 5 8.18 (1H, dd, J= 7.6 Hz, J= 1 .2 Hz), 
7.63 (1H, td, J= 7.5 Hz, J= 1.2 Hz), 7.55 (1H, d, J= 6.6 Hz), 7.46 (1H, td, J= 7.5 Hz, /= 1.5 
Hz), 4.894 (2H, s). 

[00104] 4-Azidobenzoic Acid (Used to produce template 7 in FIG. 5B). This reagent was 
purchased from Sigma-Aldrich (St. Louis, MO). 

[00105] 4-Azidobenzyl-Cvclohexvl Dicarboxylic Acid Monoester (Used to prepare template 8 
in FIG. 5B). 7rara-cyclohexyl dicarboxylic acid (200 mg, 1.16 mmol), EDC (223 mg, 2.32 
mmol), and N,N-diisopropylethylamine (0.4 mL, 2.32 mmol) were dissolved in CH 2 C1 2 (4 mL) 
and stirred for 30 min at 25°C. To this mixture was added 4-azido benzyl alcohol (86.6 mg, 0.58 
mmol). The reaction was stirred 12 h at 25°C. The reaction mixture was concentrated and 
purified by flash chromatography (30% EtOAc/hexanes). The desired ester was obtained as a 
yellow solid (18.2 mg, 5%). The resulting product was found to have the following 
characteristics: *H NMR (300 MHz, CDC1 3 ) 8 7.34 (2H, d, J= 8.4 Hz) 7.03 (2H, d, J= 8.4 Hz) 
5.08 (1H, s) 2.33 (2H, m) 2.09(4H, d, J= 9.3 Hz) 1.47 (4H, t, J= 9.9 Hz); ESMS calculated for 
C15H17N3O4 (M+HC0 2 '): 348.1196; observed: 348.1195. 

[00106] 4-Azidobenzvl-Succinic Acid Monoester (Used to prepare template 9 in FIG. 5B). 4- 
Azidobenzyl alcohol (100 mg, 0.67 mmol), succinic anhydride (134 mg, 1.37 mmol), and N,N- 
dimethylaminopyridine (3.7 mg, 30 u.mol) were dissolved in DMF (1 mL) and stirred 12 h at 
25 °C. The reaction mixture was concentrated and purified by flash chromatography (30% 
EtOAc/hexanes). The desired ester was obtained as yellow solid (75.9 mg, 45%). The resulting 
product was found to have the following characteristics: *H NMR (300 MHz, CDCI 3 ) 8 7.28 
(2H, d, J= 7.8 Hz) 6.96 (2H, d, J= 7.8 Hz), 5.05 (2H, s), 2.63 (4H, m); ESMS calculated for 
C7H6N3O2: 248.0672; observed: 248.0660. 
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[°°107] 4-Azidobenzvl-Diphenicacid Monoester (Used to prepare template 10 in FIG. 5B). 
4-Azidobenzyl alcohol (1 12 mg, 0.5 mmol) and diphenic acid anhydride (74.5 mg 5 0.5 mmol) 
were dissolved in pyridine (1 mL) and stirred 12 h at 25 °C. The reaction mixture was diluted in 
EtOAc, washed with phosphate buffer (pH 6.0, 2x), then washed with brine. The organic layer 
was dried over Na 2 S0 4 and concentrated in vacuo. The crude product was purified by flash 
chromatography (25% EtOAc/hexanes). The desired ester was obtained as yellow solid (193 
mg, 99%.). The resulting product was found to have the following characteristics: ] H NMR 
(300 MHz, CDC1 3 ) 5 9.87 (1H, s) 7.92 (1H, dd, J= 7.2 Hz, /= 1.2 Hz) 7.86 (1H, dd, J= 7.8 Hz, 
J= 1.2 Hz) 7.43 (2H, dd, J= 5.7 Hz J= 1.5 Hz) 7.38 (2H, dd, J= 5.7 Hz, J= 1.2 Hz) 7.32 (2H, 
dd, J- 7.5 Hz, J= 1.2 Hz) 7.28 (2H, dd, J= 7.3 Hz , J= 1.2 Hz) 6.98 (2H, J= 8.4 Hz) 6.85 (2H, 
J= 8.4 Hz) 4.91(2H, J= 2.7 Hz); ESMS calculated for C2iH 16 N 3 04 (M+H + ): 374.1 141; 
observed: 374.1149. 

[00108] 1-Azidomethvlbenzovl Thio Acetic Acid Thioester (Used to prepare template 11 in 
FIG. 5B). 2-Azidomethylbenzoyl acid (40 mg, 0.23 mmol) was mixed with EDC (64.9 mg, 0.34 
mmol) and N-hydroxysuccinimide (NHS) (39.1 mg, 0.34 mmol) in CH 2 C1 2 at 25°C for 2 h. The 
reaction mixture was washed with NaHC0 3 (2x), then washed with brine. The organic layer was 
concentrated and the crude product was directly used in the next step without further 
purification. AMiydroxylsuccinimidyl 2-azidomethyl benzoate ester (16.4 mg, 47 jumol) and 
thioacetic acid (3.2 |uL) in DMF (250 |uL) were allowed to react at 25 °C for 24 h. The reaction 
mixture was diluted in EtOAc and washed with NaHC0 3 (2x), then washed with brine. The 
organic layer was dried with Na 2 S0 4 and concentrated in vacuo. The crude mixture was purified 
by flash chromatography (30% EtOAc/hexanes) to provide the thioester (9.8 mg, 83%). The 
resulting product was found to have the following characteristics: ! H NMR (300 MHz, CDC1 3 ) 5 
7.95 (d, 1H, J= 7.8 Hz) 7.56 (t, 1H, J= 7.8 Hz) 7.50 (d, 1H, J= 6.6 Hz) 7.41 (t, 1H, ■/= 7.8 Hz) 
4.64 (s, 2H) 3.88 (s, 2H); ESMS calculated for C 7 H 6 N 3 02: 250.0287; observed: 250.0284. 

[00109] 2-Azidomethvlbenzovl Thio Propionic Acid Thioester (Used to prepare template 12 
in FIG, 5B). 2-azidomethylbenzoyl N-hydroxy succinimidyl (NHS) ester was prepared by 
mixing equal volumes of 1-azido methyl benzoic acid (used to produce template 6 in FIG. 5B) 
(900 mM in DMF), EDC (900 mM in DMF) and NHS (900 mM in DMF) at 25 °C for 1 h. The 
thiol group was attached to the DNA oligonucleotide in a parallel preparation, and was 
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incorporated into the template upon template formation (see below, preparation for 5' 2- 
azidomethylbenzoyl thio propionic acid thioester-linked DNA). 

(ii) Preparation of Functionalized Oligonucleotides 

[00110] Throughout this Example and the following Examples, oligonucleotides were 
5 synthesized on a Perseptive Biosystems Expedite 8090 DNA synthesizer using standard 

phosphoramidite protocols and purified using preparative scale reverse-phase HPLC. Reagents 
for automated solid-phase oligonucleotide synthesis were purchased from Glen Research. For 
amine-terminated and biotinylated DNA oligonucleotides described below, 5 5 amino-modifier 5 
(Glen Research) was used to prepare 5' amino-modified oligonucleotides; 3' amino-modifier C7 
10 CPG (Glen Research) was used to prepare 3' amino-modified oligonucleotides; and biotin TEG 
CPG (Glen Research) was used to prepare 3 5 biotin-labeled oligonucleotides. Functionalized 
DNA oligonucleotides were purified by analytical scale reverse-phase HPLC. 

[00111] Concentrations of purified oligonucleotides in solution were determined based on 
their absorbance at 260 nm measured on a Hewlett-Packard 8453 UV-visible spectrophotometer 
15 (Agilent Technologies). Oligonucleotides stained with ethidium bromide were visualized and 
quantitated by UV transillumination and densitometry using an Eagle Eye II densitometer 
(Stratagene). 

(a) Template Oligonucleotides 

[001 12] 5 9 Azide-Linked DNA Oligonucleotide Templates (Used to produce templates 1-11 
20 in FIG. 5B). The iV-hydroxy succinimidyl (NHS) ester of the desired azido acid was prepared 
by mixing equal volumes of the respective azido acid (900 mM in DMF), EDC (900 mM in 
DMF) and NHS (900 mM in DMF) at 25°C for 1 h. The crude NHS ester was added in two 
portions (50 |liL each) to a solution containing 5' amino-modified DNA oligonucleotide (50 juL 9 
typically 300 jliM) in 100 mM sodium phosphate buffer (pH 7.2, 350 juL). The 30-mer template 
25 used in these preparations was-5'NH 2 (C2H40)2-P0 3 H-GGT ACG AAT TGC ACT CGG GAA 
ATC CAC CTT (SEQ ID NO: 1). The coupling reaction was performed at 25 °C for 1 h. The 
resulting reaction mixture was directly loaded onto a NAP-5 size exclusion column (Amersham 
Biosciences) to remove organic solvent, salts, and excess small molecules, and the azide-linked 
DNA oligonucleotides were further purified by analytical scale reverse-phase HPLC (8-30% 
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MeCN/0.1 M TEAA gradient). The desired oligonucleotide products were characterized by 
MALDI-TOF mass spectrometry. 

[00113] 5' 2-Azidomethvlbenzovl Thio Propionic Acid Thioester-Linked DNA (Used to 
produce template 12 in FIG. 5B). A solution of 2, 2 ? -dithiodipropionic acid in DMF (900 mM) 
5 was mixed with equal volumes of EDC (900 mM in DMF) and NHS (900 mM in DMF) at 25°C 
for 1 h. The crude NHS ester (50 juL) was added to a solution containing 5' amino-modified 
DNA oligonucleotide (50 |aL, typically 300 juM) in 100 mM sodium phosphate buffer (pH 7.2, 
350 |uL). The coupling reaction was performed at 25 °C for 1 h. The reaction mixture was 
directly loaded onto a NAP-5 size exclusion column (Amersham Biosciences) and purified by 

10 analytical scale reverse-phase HPLC (8-30% MeCN/0.1 M TEAA gradient). The disulfide- 
linked oligonucleotide product was characterized by MALDI-TOF mass spectrometry. The 2- 
thiopropionic acid-linked oligonucleotide was prepared by treating the disulfide-linked 
oligonuleotide above (typically 10 pM) in 100 mM CAPS buffer (pH 8) with 20 mM DTT at 
25°C for 0.5 h. Excess DTT was removed by passing the reaction mixture through a gel 

15 filtration column. In parallel, 2-azidomethylbenzoyl //-hydroxy succinimidyl (NHS) ester was 
prepared as described (see above, preparation for 2- Azidomethyl benzoyl Thio Propionic Acid 
Thioester). The crude NHS ester (100 pL) was added to a solution of 5' thiol-linked 
oligonucleotide (100 |liL) in 100 mM sodium phosphate buffer (pH 7.2, 300 juL). The coupling 
reaction was performed at 25°C for 1 h. The reaction mixture was directly loaded onto a NAP-5 

20 size exclusion column and purified by analytical scale reverse-phase HPLC (8-30% MeCN/0. 1 
M TEAA gradient). The desired oligonucleotide product was characterized by MALDI-TOF 
mass spectrometry. 

(b) Transfer Units 

[00114] 3' Triphenyfohosphine-Linked DNA . Attachment of the triphenylphosphine group 
25 was performed on 3' amino-modified oligonucleotides linked to CPG resin. A 10-mer reagent 
fully complementary to the template had the structure 5' AAT TCG TAG C-OPO3H- 
CH2CH(CH 2 OH)(CH 2 )4NHCOC 6 H 4 PPh 2 (SEQ ID NO: 2). A 10-mer reagent containing a three- 
base mismatch relative to the template had the following structure - 5' AAT ACA TCC C- 
OP0 3 H-CH2CH(CH 2 OH)(CH 2 ) 4 NHCOC 6 H4PPh 2 (SEQ ID NO: 3). The latter was used as a 
30 control in these experiments. 
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[00115] The Fmoc group on 3 5 FMOC-NH-oligonucleotides was removed by three cycles of: 
(0 treatment with 20% piperidine in DMF for 10 min; (fl) washing with DMF; and (tit) washing 
with MeCN. The resin was dried under a stream of nitrogen gas. A solution of 4- 
diphenylphosphino benzoic acid (30.6 mg, 100 jLimoI), EDC (19.1 mg, 100 |Umol), N,N- 
5 diisopropylethylamine (36.8 jllL, 21 1 |umol) in DMF (0.6 mL) was added to the resin and the 
mixture was incubated at 37 °C for 2 h. The resin was washed with DMF (2x) and with MeCN 
(2x), then dried under nitrogen. The derivatized oligonucleotide was cleaved from the CPG resin 
by incubation in 1:1 ammonium hydroxidermethyl amine (AMA) with tris(2- 
carboxyethyl)phosphine hydrochloride (TCEP-HC1, 1 mg) at 55 °C for 45 min. The cleavage 
10 solution was filtered and purified by analytical scale reverse-phase HPLC (8-30% MeCN/0.1 M 
TEAA gradient). The desired oligonucleotide products were characterized by MALDI-TOF 
mass spectrometry. 

(c) Capture Reagents 

[00116] In order to perform polyacrylamide gel electrophoresis, the reaction products were 
1 5 captured using 20-mer secondary reagents (capture reagents) that annealed to the template. 

[00117] Amino-modified DNA (This reagent was used to capture the products of the 
templates 8-10). The 20-mer secondary reagent contained the sequence 5' TCC CGA GTG 
CAA TTC GTA CC-OP0 3 H-CH2CH(CH 2 OH)(CH 2 ) 4 NH 2 (SEQ ID NO: 4). This 
oligonucleotide was used as a starting material for the following capture reagents. 

20 [00118] V Bromoacetate-Linked DNA (This reagent was used to capture the products of the 
templates 11 and 12). The NHS ester of bromoacetic acid was prepared by mixing equal 
volumes of 900 mM bromoacetic acid in DMF, 900 mM EDC in DMF, and 900 mM NHS in 
DMF at 25 °C for 1 h. The crude NHS ester (100 j^L) was added to a solution of 5' amino- 
modified DNA oligonucleotide (50 |uL 5 typically 300 jaM) in 100 mM sodium phosphate buffer 

25 (pH 7.2, 350 juL). The coupling reaction was allowed to proceed at 25 C for 1 h. The reaction 
mixture was directly loaded onto a NAP-5 size exclusion column to remove organic solvent, 
salts, and excess small molecules and was further purified by analytical scale reverse-phase 
HPLC (8-30% MeCN/0.1 M TEAA gradient). The desired oligonucleotide products were 
characterized by MALDI-TOF mass spectrometry. 
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[00119] 3 9 4-Formvlbenzoate-Linked DNA (This reagent was used to capture the products of 
the template 7). The 4-formylbenzoate linked 20-mer DNA was prepared following the protocol 
for bromoacetate-1 inked DNA using 4-formyl benzoic acid instead of bromoacetic acid. 

[00120] 3* Succinic Acid Monoester-Linked DNA (This reagent was used to capture the 
products of the templates 1-6). Succinic anhydride (22 mg, 0.1 mmol) was activated with NHS 
(10 mg, 0.1 mmol) in DMF (200 |uL) at 25 °C for 15 min. 100 juL of the mixture was added to 
the 3 5 amino modified template (50 [iL, typically 300 ^iM) in 100 mM HEPES buffer (pH 8.5; 
850 }iL) and was incubated at 37 °C for 16 h. The reaction mixture was desalted by NAP-5 size 
exclusion column and further purified by analytical scale HPLC (8-30% MeCN/0.1 M TEAA 
gradient). The desired oligonucleotide products were characterized by MALDI-TOF mass 
spectrometry. 

II. Results and Conclusions 

(i) DNA~templated Transformation from an Azide to an Amine 

[00121] A variety of organic azides linked to the 5' termini of 30-mer DNA oligonucleotide 
templates were reacted with a triphenylphosphine conjugated to the 3' terminus of a 
complementary DNA 10-mer (see, FIG. 5A). DNA-templated azide-to-amine functional group 
transformations were performed by mixing a 30-mer 5' azide-linked template (12 pmol) and 10- 
mer 3' triphenylphosphine-linked reagent (24 pmol) in a total volume of 200 juL of 100 mM 
CAPS buffer (pH 10) containing 500 mM NaCl at 25 °C for 16 h. For substrates 4 and 5, 1 M 
NaCl and the addition of 0.5 mM DTT to inhibit phosphine oxidation was found to increase 
yields. Representative reaction conditions included for 1-7, 60 nM azide, 120 nM phosphine, 
0.1 M CAPS pH 10, 0.5 M NaCl; for 8-11, as above, except 0.1 M MES pH 6.0, 1 M NaCl; and 
for 12, as above, except 0.1 M MOPS pH 7.5, 1 M NaCl. 

[00122] Unlike DNA-templated coupling reactions, the azide-to-amine transformations could 
not be monitored directly by denaturing polyacrylamide gel electrophoresis (PAGE) because the 
starting materials and products had similar molecular weights. To assay the progress of these 
reactions, the putative amine products were captured with 20-mer-linked carboxylic acids in the 
presence of a carbodiimide, or with 20-mer-linked aldehydes in the presence of NaBH 3 CN. 
These secondary reagents or capture reagents displaced the 10-mer linked phosphine oxide and 
efficiently coupled with primary amines, but not with azides. The resulting amide or secondary 
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amine products gained the molecular weight of the 20-mer and could easily be distinguished 
from starting azides by PAGE. 

[00123] In order to capture amine products derived from substrates 1-6 a 20-mer 3' carboxylic 
acid-linked reagent (24 pmol) was added to the reaction mixture with EDC (30 mM) and sulfo- 
NHS (15 mM) in MES buffer (pH 6.5). In order to capture amine products derived from 
substrate 7, the product was captured with a 20-mer 3 1 aldehyde-linked reagent in the presence of 
NaBH 3 CN (3 mM) in MES buffer (pH 6.5). Following product capture, the DNA-linked species 
were precipitated with NaOAc (pH 5), ethanol, and glycogen. 

[00124] The resulting pellets were dissolved in denaturing gel-loading buffer and were 
subjected to denaturing PAGE analysis. Unless specified, denaturing PAGE analysis was 
performed using 15% polyacrylamide gel (TBE-urea). 

[00125] Reaction yields were quantitated by ethidium bromide staining of the gels, UV 
visualization and CCD-based densitometry of product and template bands. Yield calculations 
assumed that templates and products in denaturing gels stained with equal intensity per base. In 
cases where products were partially double-stranded during quantitation, changes in staining 
intensity may result in higher apparent yields. 

[00126] Typical results obtained by denaturing PAGE analysis are shown in FIG, 6. FIG. 6A 
shows denaturing PAGE analysis of a DNA-templated azide-to-amine transformation for azide 3 
in FIG, SB. FIG. 6B shows denaturing PAGE analysis of a DNA-templated azide-to-amine 
transformation for azide 7 in FIG. 5B. 

[00127] For the seven azides tested (substrates 1-7 in FIG. 5B), DNA-templated azide 
reduction proceeded efficiently at pH 10. The actual yields of the reaction products are 
summarized in FIG. SB. In each case, control reactions in which the phosphine was linked to a 
non-complementary, mismatched oligonucleotide did not generate significant amide or 
secondary amine products, indicating that these DNA-templated azide-to-amine transformations 
proceed sequence-specifically. 

(ii) DNA-templated Transformation from an Azide to Carboxylic Acid or Thiol 

[00128] The scope of the reactions was further extended to effect azide-to carboxylic acid and 
azide-to-thiol functional group transformations (see, FIG. 5 A). In both cases, azide reduction 
induced spontaneous fragmentation to unmask carboxylic acid or thiol groups. To assess the 
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efficiency of these reactions, DNA-linked amines were used to capture carboxylic acids 
(products resulting from substrates 8-10) in the presence of a carbodiimide, while DNA-linked 
alkyl bromides were used to capture thiol products (products resulting from substrates 11 and 
12). 

[00129] DNA-templated azide-to-carboxylic acid transformations were performed like the 
azide-to-amine transformations, except that the buffer contained 0.1 M MES pH 6.0 and 1 M 
NaCl. To capture carboxylic acid products, 20-mer 3 5 amine-linked reagent was added to the 
reaction mixture with EDC (30 mM) and sulfo-NHS (15 mM) in MES buffer (pH 6.5). Typical 
results from denaturing PAGE are shown in FIG. 6C (represented is the case using reagent 8 
from FIG. 5B). 

[00130] DNA-templated azide-to-thiol transformations were performed as above, except that 
the buffer contained either 0.1 M MES pH 6.0 (for substrate 11) or MOPS pH 7.5 (for substrate 
12) and 1 M NaCl. To capture thiol products, 20-mer 3' alkyl bromide-linked reagent was added 
to the reaction mixture and incubated at 37 °C for 6 h. Typical results from denaturing PAGE 
are shown in FIG. 6D (represented is the case using reagent 11 from FIG. 5B). 

[00131] For the carboxylic acid-to-amine and thiol-to-amine transformations, denaturing 
PAGE analysis indicated that DNA-templated functional group transformations to unmask 
carboxylic acid and thiol groups (substrates 8-12 in FIG. 5B) also proceeded efficiently and 
sequence-specifically. 

Example 2. DNA-Tem plated Transformation of Azides into Primary Amines, Carboxylic 
Acids, and Thiols (Characterization by Mass Spectrometry) 

[00132] This Example is similar to Example 1 except the reaction products were characterized 
by mass spectrometry rather than PAGE. To facilitate this a smaller template and different 
capture system were used under the same or similar conditions. 

I. Materials and Methods 

(i) Synthesis of Azido Acids 

[00133] The azido substrates for the synthesis of compounds 1-12 shown in FIG. 5B were 
prepared as described in Example 1. 

(ii) Preparation of Functionalized Oligonucleotides 
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[00134] The oligonucleotides used in this Example were prepared in a manner similar to 
Example 1 with the following changes. 

(a) Template Oligonucleotides 

[00135] The template oligonucleotides were prepared as described in Example 1 except that 
5 rather than using a 30-mer template, the following 10-mer template was used: 5'-NH 2 (C 2 H 4 0) 2 - 
PO3H-GGT ACG AAT T-OP0 3 H-CH(CH 2 OH)CH2(OC2H 4 )4CH 2 NHCO-biotin (SEQ ID NO: 
5). 

(b) Transfer Units 

[00136] The triphenylphosphine-Iinked reagent was prepared as described in Example 1 . 

1 0 (Hi) Mass Spectroscopic Analysis 

[00137] MALDI-TOF mass spectrometry was performed on an Applied Biosystems Voyager- 
DE Pro Biospectrometry Workstation and processed with Voyager Data Explorer software. A 
mixture of nine parts hydroxypicolinic acid (HP A, 50 mg/mL in 50% MeCN/H 2 0) and one part 
ammonium citrate (50 mg/mL in H 2 0) was used as the matrix in all experiments. 

15 IL Results and Conclusions 

[00138] Complementary DNA-linked phosphine reagent (24 pmol) was added to a solution of 
10-mer 5 ? -azide-linked, 3'-biotinyIated template (12 pmol) in 100 mM CAPS buffer (pH 10) 
with 500 mM NaCl. The mixture was agitated at 25° C for 0.5 h then at 37 °C for 12 h. The 
biotinylated products and unreacted templates were purified by treating the reaction mixture with 
20 streptavidin-linked magnetic particles (Roche) and eluted following the manufacturer's protocol. 
DNA in the eluant was precipitated with ethanol and glycogen. Substrates 11-12 were subjected 
directly to subsequent mass spectroscopic analysis. Samples for MALDI-TOF analysis were 
prepared by desalting the pellets dissolved in the matrix solution using a ZipTip CI 8 column 
(Millipore). 

25 [00139] The resulting iminophosphoranes were identified by MALDI-TOF mass spectrometry 
and found to be unexpectedly stable to hydrolysis especially under acidic conditions, presumably 
due to formation of a stable HC1 salt (Shalev, et ah (1996) J. Org. Chem. 61 : 1689-1701). 
Treatment of template-linked azides with DNA-linked phosphine in pH 10 buffer at 25 °C for 
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0.5 h followed by 37 °C for 12 h, however, resulted in quantitative iminophosphorane hydrolysis 
to generate the corresponding primary amines. 

[00140] The results from the MALDI-TOF analysis are summarized in Table 1 where 
reagents 1-12 are denoted as in FIG. 5B. Due to the instability under the conditions for MALDI- 
TOF experiments, thiol-linked products (11-12) were captured as alkyl thioether adducts by 
treating with iodoacetamide (5 mM) following the DNA-templated Staudinger reaction (the 
MALDI-TOF data for 11-12 in Table 1 are of captured thioether adducts). 



Table 1 



"Reagents .• .;•«;;% 
(see, FIG. 5B) 


Expected Mass .• . '•' - t> , f , 


Observed Mass 


1 


5866.05 


5868.02 ± 9 


2 


5922.16 


5926.50 ± 9 


3 


5942.15 


5945.18 ±9 


4 


5937.13 


5940.98 ± 9 


5 


5934.17 


5934.61 ±9 


6 


5942.15 


5944.46 ± 9 


7 


5928.12 


5930.91 ± 9 


8 


5963.16 


5968.42 ± 9 


9 


5909.07 


5915.97 ±9 


10 


6032.23 


6038.24 ± 9 


11 


5941.16 


5944.61 ±9 


12 


5955.19 


5957.01 ± 9 



[00141] Mass spectrometric analysis of azide reduction reactions was consistent in each cas 
with the formation of expected primary amine products. A representative spectrum is shown i 
FIG. 7. 
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[00142] Based on the mass spectroscopic analyses set forth in Table 1, the sequence specific 
azide-to-amine, azide-to-carboxylic acid, and azide-to-thiol transformations all produced the 
appropriate products. 

Example 3. Transformations of Amine-Linked Templates Using Small Molecule Reagents 

5 [00143] To further explore the ability of DNA-templated functional group transformations to 
enable non-DNA-linked reagents to participate in sequence-programmed synthesis, four DNA 
templates (templates 13-16, FIG. 8) were prepared, each containing a different azide at the 5 5 
terminus, one of four unique six-base codons, and a biotin group at the 3 f terminus to facilitate 
template manipulation and purification. The azide-linked templates then were chemically 

10 converted into amine-linked templates by exposure to TCEP-HC1. The resulting amines then 
were reacted with free reagents to determine whether the conversion of the amine intermediate 
into a final product was possible. In particular, dansyl chloride (21), ethyl chloroformate (22), 4- 
methoxy phenyl isocyanate (23) and 6-morpholino pyridinyl 3-isothiocyanate (24) were all 
chosen as amine-reactive agents as they cannot easily be attached to DNA due to their structure 

15 or their reactivity with water. Simplified reaction schemes showing the starting reagents and 
theoretical end products are shown in FIG! 9. 

I. Materials and Methods 

[00144] Coding sequences for the templates were designed by computational screening to (/) 
ensure that at least 6 non-complementary base pairs existed between any two different codons, 
20 (ii) maintain a constant %GC per codon in order to minimize differences in melting temperatures 
between reagents, and (Hi) vary in mass such that the molecular weights of the 16 theoretical 
small-molecule coupling products are distinct and identifiable by MALDI-TOF mass 
spectrometry. 

[00145] Each of the templates used in the schemes shown in FIG. 8 (templates 13-16) 

25 contained 5 ' NH 2 (C2H40)2-P03H-TT-(codon)-GTA n -OP0 3 H- 

CH(CH 2 OH)CH2(OC2H 4 )4CH 2 NHCO-biotin. The codons used for each template were as 
follows: the codon for template 13 was GTG CAA CGT CAT, n = 0 (SEQ ID NO: 6); the codon 
for template 14 was CCT AGT CGT CAT, n = 3 (SEQ ID NO: 7); the codon for template 15 was 
TAA GCC CGT CAT, n = 2 (SEQ ID NO: 8); and the codon for template 16: AGC TTG CGT 

30 CAT, n = 1 (SEQ ID NO: 9). 
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[00146] The azide-containing templates 13-16, were prepared as described in Example 1 (see 
templates 1, 3, 4, and 2, respectively). 

[00147] The azide groups in templates 13-16 were chemically transformed into amine groups 
by exposure to TCEP-HCI. Briefly, amine-linked templates then were prepared by treating the 
5 azide-linked templates (templates 13-16) with 5 mM TCEP-HCI in 100 mM MOPS buffer (pH 
7.5) at 25 °C for 3 h. The resulting templates were purified by HPLC. Thereafter, the resulting 
templates (amine intermediates) were reacted with soluble reagents to determine whether 
functional group transformations were possible. 

II. Results and Conclusions 

10 [00148] Once the amine-linked templates 13-16 were created, they were then exposed to 
soluble reagents to see whether functional group transformations were possible. Each of the 
transformations are discussed in detail below. 

[00149] The amine-linked template 13 (400 pmol) in 100 ]iL of 100 mM aqueous NaHC0 3 
(pH 9.0) was mixed with 20 mM dansyl chloride 21 in 100 |uL DMF and agitated at 37 °C for 1 
15 h. The reaction mixture was diluted in 200 \xL 0. 1 M TEAA and passed through a NAP-5 size 
exclusion column. The eluant in 1 mL 0.1 M TEAA was analyzed by analytical scale reverse 
phase HPLC (8-30 % MeCN/ 0.1 M TEAA gradient). Product yield was calculated based on the 
integrated peak areas (based on UV absorbance at 260 nm) of the starting material, the product, 
and any side products. A representative chromatogram is shown in FIG. 10A. 

20 [00150] Amine-linked template 14 (400 pmol) in 100 juL of 200 mM aqueous NaHC0 3 (pH 
9.0) was mixed with 40 mM ethyl chloroformate 22 in 100 |uL DMF and agitated at 37 °C for 1 
h. The reaction was quenched by addition of glycogen in NaOAc buffer (pH 5.0) followed by 
ethanol precipitation. The pellet was dissolved in 0.1 M TEAA and analyzed by analytical scale 
reverse phase HPLC (8-30 % MeCN/ 0.1 M TEAA gradient). A representative chromatogram is 

25 shown in FIG. 10B. 

[00151] Amine-linked template 15 (400 pmol) in 100 juL of 500 mM aqueous triethylamine 
(pH 10) was mixed with 20 mM 4-methoxyphenylisocyanate 23 in 100 juL DMF and agitated at 
37 °C for 1 h. The reaction mixture was quenched and analyzed as described above. A 
representative chromatogram is shown in FIG. IOC. 
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[00152] Amine-linked template 16 (400 pmol) in 100 \jlL of 500 mM aqueous triethylamine 
(pH 10) was mixed with 20 mM and was allowed to react with 20 mM 6-morpholino 
isothiocyanate 24 in 100 |uL DMF and agitated at 37 °C for 1 h. The reaction mixture was 
quenched and analyzed as described above. A representative chromatogram is shown in FIG. 
5 10D. 

[00153] When soluble reagents (free reactants) 21, 22, 23, or 24 were added in excess (10 or 
20 mM final concentration) in DMF to a template-linked primary amine under basic conditions 
(pH 9-10), the corresponding sulfonamide, carbamate, urea, or thiourea was efficiently 
generated (70% yield for 21, > 86% for 22, 23 and 24). These results demonstrated that it was 
10 possible to convert a template coupled to an amide into a sulfonamide, a carbomate, urea or 
thiourea using free reactants. 

Example 4, Sequence Specific Transformation of Four Azide-linked Templates Using 
Small Molecule Reagents 

[00154] This Example demonstrates that it is possible to perform a sequence specific 
15 transformation of template bound reactants to generate reaction intermediates, which can then be 
reacted with free reactants to produce reaction products. In particular, a single-solution mixture 
of azide linked templates were sequence-specifically transformed into amine intermediates. The 
amine intermediates were then sequence-specifically modified into sulfonamide, carbamate, 
urea, and thiourea products using sulfonyl chloride, chloroformate, isocyanate, and 
20 isothiocyanate reactants not linked to DNA. 

I. Materials and Methods 

(i) Preparation of Functionalized Oligonucleotides 

(a) Template Oligonucleotides 

[00155] The templates 13-16 were prepared as described in Example 3. 

25 (b) Transfer Units 

[00156] Each of the following triphenylphosphine-linked oligonucleotides were prepared as 
described in Example 1. 

[00157] Oligonucleotide 17 (FIG. 9) had the structure 5 5 CGT TGC ACA A- OPO3H- 
CH 2 CH(CH 2 OH)(CH 2 )4NHCOC6H 4 PPh 2 (SEQ ID NO: 10). Oligonucleotide 18 (FIG. 9) had 
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the structure 5' CGA CTA GGA A- OP0 3 H-CH 2 CH(CH 2 OH)(CH 2 )4NHCOC 6 H4PPh 2 (SEQ ID 
NO: 1 1). Oligonucleotide 19 (FIG. 9) had the structure 5' CGG GCT TAA A- OP0 3 H- 
CH 2 CH(CH 2 OH)(CH 2 )4NHCOC6H 4 PPh2 (SEQ ID NO: 12). Oligonucleotide 20 (FIG. 9) had 
the structure 5' CGC AAG CTA A- OP0 3 H-CH 2 CH(CH 2 OH)(CH 2 )4NHCOC6H 4 PPh 2 (SEQ ID 
5 NO: 13). 

IL Results and Conclusions 

[00158] A mixture of templates 13-16 was combined with sequence specific reactant 17 and 
then free reactant 21. The resulting solution was similarly combined with sequence specific 
reactant 18 followed by free reactant 22; sequence specific reactant 19 followed by free reactant 
10 23; and sequence specific reactant 20 followed by free reactant 24. 

[00159] More specifically, 3' triphenylphosphine-linked oligonucleotide 17 (8 equiv.) was 
added to a single solution mixture of the four 5' azide-linked templates (templates 13-16, 100 
nM for each template) in 100 mM CAPS buffer (pH 10) and 500 mM NaCl to effect azide-to- 
amine transformation. The mixture was incubated at 25 °C for 0.5 h then 37 °C for 12 h. The 

15 oligonucleotides were precipitated by the addition of glycogen in NaOAc buffer (pH 5.0) and 
ethanol. The pellet was dissolved in 100 juL of 100 mM NaHC0 3 and was allowed to react with 
dansyl chloride 21 in 100 juL of DMF (20 mM) at 37 °C for 1 h. The reaction mixture was 
desalted by ethanol precipitation. If the DNA-templated azide-to-amine transformation 
proceeded sequence-specifically, only the amine arising from template 13 should react with 

20 sequence specific reactant 21 to generate sulfonamide 25, while templates 14-16 should remain 
unaltered (see, FIG. 8). Excess sulfonyl chloride was removed upon ethanol precipitation, and 
any unreacted amines were removed using N-hydroxysuccinimidyl (NHS) ester-linked resin. 

[00160] The DNA-templated azide-to-amine transformation described above then was 
repeated using phosphine-linked oligonucleotide 18. The pellet was dissolved in 100 |uL of 200 
25 mM NaHC0 3 and was allowed to react with ethyl chloroformate 22 in 100 juL of DMF (40 mM) 
at 37 °C for 1 h. The reaction mixture was desalted by ethanol precipitation and dried. 

[00161] The DNA-templated azide-to-amine transformation described above then was 
repeated using phosphine-linked oligonucleotide 19. The pellet was dissolved in 100 juL of 500 
mM aqueous triethylamine solution and was allowed to react with 4-methoxyphenylisocyanate 
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23 in 100 |uL of DMF (20 mM) at 37 °C for 1 h. The reaction mixture was desalted by ethanol 
precipitation and dried. 

[00162] The DNA-templated azide-to -amine transformation described above then was 
repeated using phosphine-linked oligonucleotide 20. The pellet was dissolved in 100 |liL of 500 
5 mM aqueous triethylamine solution and was allowed to react with 6-morpholino-3- 

pyridinylisothiocyanate 24 in 100 |uL of DMF (20 mM) at 37 °C for 1 h. The reaction mixture 
was desalted by ethanol precipitation and dried. The pellet was dissolved in 100 mM MES 
buffer (pH 6), first treated with TCEP-HC1 (5 mM) at 25 °C for 2 h then with NHS activated 
resin (Amersham Biosciences; 5 |liL resin solution for 100 pmol template) for another 2 h. The 
10 resin was removed by filtration and washed three times with 0.1 M TEAA. 

[00163] The final mixture of products was purified by capturing template-linked biotin groups 
with streptavidin linked to magnetic particles. The captured oligonucleotides were eluted from 
the particles following the manufacturer's protocol. The DNA in the eluant was precipitated 
with NaOAc (pH 5.0), glycogen, and ethanol. DNA recovery was determined spectrometrically 

15 by monitoring UV absorbance for the starting material pool and the final product pool at 260 nm. 
The concentration for a mixture containing equal amounts of products 25-28 (see, FIG. 8) with a 
UV absorbance of 1 .0 at 260 nm was estimated to be 5.5 jjM. Samples for MALDI-TOF 
analysis were prepared as described in Example 2. FIG. 11A and FIG. 11B show 
representative spectra of starting materials (templates 13, 14, 15 and 16) and products (products 

20 25, 26, 27, 28), respectively. 

[00164] MALDI-TOF mass spectrometry revealed that the final product mixture contained 
predominantly the four sequence-programmed products (sulfonamide 25, carbamate 26, urea 27, 
and thiourea 28). None of the 12 possible undesired cross-products were observed. 

[00165] UV spectrometry analysis indicated that the final product mixture was generated in 
25 51% overall yield for the four consecutive DNA-templated reduction and small-molecule 
coupling sequences. These results establish that DNA-templated functional group 
transformations enable non-DNA-linked small molecules to participate in sequence-programmed 
reactions. The efficiency of this process also highlights the value of molecular biology-based 
purification and washing strategies made possible when performing organic synthesis on this 
30 minute (sub-nmol) scale. 
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[00166] Taken together, the DNA-templated functional group transformation described in this 
Example expands the synthetic capabilities of nucleic acid-templated synthesis by addressing the 
need for reagents to be tethered to oligonucleotides. When the linkage of reagents to 
oligonucleotides is not possible or is inconvenient, these transformations allow such reagents to 
nevertheless contribute to small molecule syntheses while preserving the correspondence 
between an oligonucleotide sequence and a product structure. 

INCORPORATION BY REFERENCE 

The entire disclosure of each of the patent documents and scientific articles referred to 
herein is incorporated by reference for all purposes. 

EQUIVALENTS 

The invention may be embodied in other specific forms without departing from the spirit 
or essential characteristics thereof. The foregoing embodiments are therefore to be considered in 
all respects illustrative rather than limiting on the invention described herein. Scope of the 
invention is thus indicated by the appended claims rather than by the foregoing description, and 
all changes that come within the meaning and range of equivalency of the claims are intended to 
be embraced therein. 
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WHAT IS CLAIMED IS: 



1 1 . A method of synthesizing a reaction product, the method comprising the steps of: 

2 (a) providing a mixture comprising a first reactive unit and a second reactive unit 

3 under conditions to induce a reaction between the first and second reactive units to 

4 produce a reaction intermediate; 

5 (b) providing an oligonucleotide comprising an identifying sequence covalently 

6 attached to the reaction intermediate; and 

7 (c) combining the reaction intermediate with a free reactant selectively reactive with 

8 the reaction intermediate, thereby synthesizing a reaction product linked to the 

9 identifying sequence, wherein the free reactant is more reactive with the reaction 
10 intermediate than with at least one of the reactive units in the starting mixture. 

1 2. The method of claim 1, wherein the identifying sequence is attached to the first reactive 

2 unit prior to the reaction that produces the reaction intermediate and remains linked to the 

3 reaction intermediate. 

1 3. The method of claim 2, wherein the second reactive unit is attached to an oligonucleotide 

2 sequence complementary to the identifying sequence. 

1 4. The method of claim 3, wherein step (a) comprises hybridizing the identifying sequence 

2 with the sequence complementary to the identifying sequence, thereby bringing the first 

3 and second reactive units into reactive proximity. 

1 5. The method of claim 1, wherein step (b) comprises enzymatically attaching the 

2 identifying sequence to the reaction intermediate after formation of the reaction 

3 intermediate. 

1 6. The method of claim 1, wherein the free reactant is at least five times more reactive with 

2 the reaction intermediate than with either of the reactive units in the starting mixture. 

1 7. The method of claim 1, wherein the free reactant is at least fifty times more reactive with 

2 the reaction intermediate than with at least one of the reactive units in the starting 

3 mixture. 
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1 


8. 


The method of claim 1, wherein the free reactant is at least one thousand times more 


2 




reactive with the reaction intermediate than with at least one of the reactive units in the 


3 




starting mixture. 


1 


9. 


The method of claim 1, wherein the reaction product is synthesized with a yield greater 


2 




than or equal to 50%. 


1 


10. 


The method of claim 9, wherein the yield is greater than or equal to 75%. 


1 


11. 


The method of claim 10, wherein the yield is greater than or equal to 85%. 


1 


12. 


The method of claim 1 1, wherein the yield is greater than or equal to 98%. 


1 


13. 


A method of synthesizing a reaction product by nucleic acid-templated synthesis, the 


2 




method comprising the steps of: 


3 




(a) providing a mixture comprising (i) a first reactive unit attached to a first 


4 




oligonucleotide comprising a codon sequence, and (ii) a second reactive unit attached to a 


5 




second oligonucleotide comprising an anti-codon sequence complementary to the codon 


6 




sequence; 


7 




(b) annealing the codon sequence of the first oligonucleotide with the anti-codon 


8 




sequence of the second oligonucleotide to induce a reaction between the first and second 


9 




reactive units to form a reaction intermediate attached at least to the first oligonucleotide; 


1 ft 






1 1 

1 1 




(c^ combining the reaction intermediate with a free reactant selectively reactive with 


12 




the reaction intermediate, thereby synthesizing a reaction product linked to the first 


13 




oligonucleotide, wherein the free reactant is more reactive with the reaction intermediate 


14 




than with either of the reactive units in the starting mixture. 


1 


14. 


The method of claim 13, wherein the free reactant is at least five times more reactive with 


2 




the reaction intermediate than with either of the reactive units in the starting mixture. 


1 


15. 


The method of claim 13, wherein the free reactant is at least fifty times more reactive 


2 




with the reaction intermediate than with at least one of the reactive units in the starting 


3 




mixture. 
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1 16. The method of claim 13, wherein the free reactant is at least one thousand times more 

2 reactive with the reaction intermediate than with at least one of the reactive units in the 

3 starting mixture. 

1 17. The method of claim 13, wherein the reaction product is synthesized with a yield greater 

2 than or equal to 50%. 

1 18. The method of claim 17, wherein the yield is greater than or equal to 75%. 

1 19. The method of claim 1 8, wherein the yield is greater than or equal to 85%. 

1 20. The method of claim 19, wherein the yield is greater than or equal to 98%. 

1 21. A method of synthesizing a reaction product, the method comprising the steps of: 

2 (a) providing a mixture of a population of first reactive units and a second reactive 

3 unit under conditions that induce a reaction between at least one of the first reactive units 

4 and the second reactive unit, thereby to form a reaction intermediate co-existing with the 

5 population of first reactive units; 

6 (b) providing an oligonucleotide comprising an identifying sequence attached to the 

7 reaction intermediate; 

8 (c) combining the reaction intermediate co-existing with the first reactive units with a 

9 free reactant capable of selectively reacting with the reaction intermediate, thereby 

10 synthesizing a reaction product attached to an identifying sequence, the reaction product 

1 1 co-existing with the population of first reactive units, wherein the free reactant is more 

12 reactive with the reaction intermediate than with at least one of the reactive units in the 

1 3 starting mixture. 

1 22. The method of claim 21, wherein the identifying sequence is attached to the at least one 

2 of the first reactive units prior to and during the reaction to produce the reaction 

3 intermediate and remains attached to the reaction intermediate. 

1 23. The method of claim 22, wherein the second reactive unit is linked to a sequence 

2 complementary to the identifying sequence. 
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1 24. The method of claim 23, wherein step (a) comprises hybridizing the identifying sequence 

2 with the sequence complementary to the identifying sequence, thereby bringing the at 

3 least one of the first reactive units and the second reactive unit into reactive proximity. 

1 25. The method of claim 21, wherein step (b) comprises enzymatically attaching the 

2 identifying sequence to the reaction intermediate after formation of the reaction 

3 intermediate. 

1 26. The method of claim 21, wherein the free reactant is at least five times more reactive with 

2 the reaction intermediate than with at least one of the reactive units in the starting 

3 mixture. 

1 27. The method of claim 21, wherein the free reactant is at least fifty times more reactive 

2 with the reaction intermediate than with at least one of the reactive units in the starting 

3 mixture. 

1 28. The method of claim 21, wherein the free reactant is at least one thousand times more 

2 reactive with the reaction intermediate than with at least one of the reactive units in the 

3 starting mixture. 

1 29. The method of claim 21, wherein the reaction product is synthesized with a yield greater 

2 than or equal to 50%. 

1 30. The method of claim 29, wherein the yield is greater than or equal to 75%. 

1 31. The method of claim 30, wherein the yield is greater than or equal to 85%. 

1 32. The method of claim 31, wherein the yield is greater than or equal to 98%. 

1 33. A method of synthesizing a reaction product by nucleic acid-templated synthesis, the 

2 method comprising the steps of: 

3 (a) providing a mixture comprising a plurality of first reactive units attached to first 

4 oligonucleotides comprising codon sequences, wherein the oligonucleotide sequence is 

5 indicative of the first reactive unit attached thereto; 
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6 (b) providing a second reactive unit attached to a second oligonucleotide comprising 

7 an anti-codon sequence complementary to the codon sequence of at least one first 

8 reactive unit; 

9 (c) annealing the codon sequence of at least one of the first oligonucleotides with the 

10 anti-codon sequence of the second oligonucleotide to induce a reaction between the first 

1 1 and second reactive units to form a first reaction intermediate attached at least to a first 

12 oligonucleotide; and 

13 (d) combining the first reaction intermediate with a free reactant selectively reactive 

14 with the first reaction intermediate, thereby synthesizing a first reaction product attached 

15 to the first oligonucleotide, wherein the free reactant is more reactive with the first 

1 6 reaction intermediate than with at least one of the reactive units in the mixture. 

1 34. The method of claim 33, further comprising the steps of: 

2 (e) providing a third reactive unit attached to a third oligonucleotide comprising an 

3 anti-codon sequence complementary to the codon sequence of at least one first reactive 

4 unit; 

5 (f) annealing the codon sequence of at least one of the first oligonucleotides with the 

6 anti-codon sequence of the third oligonucleotide to induce a reaction between the first 

7 and third reactive units to form a second reaction intermediate attached at least to a first 

8 oligonucleotide; and 

9 (g) combining the second reaction intermediate with a free reactant selectively 

10 reactive with the second reaction intermediate, thereby synthesizing a second reaction 

1 1 product attached to the identifying sequence, wherein the free reactant is more reactive 

12 with the second reaction intermediate than with at least one of the reactive units in the 

13 mixture. 

1 35. An in vitro method of performing nucleic acid-templated synthesis, the method 

2 comprising the steps of: 

3 (a) providing (i) a template comprising a first reactive unit covalently attached to a 

4 first oligonucleotide defining a codon sequence, and (ii) a transfer unit comprising a 
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5 second reactive unit covalently attached to a second oligonucleotide defining an anti- 

6 codon sequence complementary to the codon sequence of the template; 

7 (b) annealing the codon and anti-codon sequences to bring the first reactive unit and 

8 the second reactive unit into reactive proximity so that the first and second reactive units 

9 react with one another to produce a reaction intermediate; and 

1° (c) contacting the reaction intermediate with a free reactant, which chemically reacts 

1 1 with the reaction intermediate to produce a reaction product, wherein the first 

12 oligonucleotide remains attached to the reaction product. 

1 36. The method of claim 35, wherein the first reactive unit is a small molecule scaffold. 

1 37. The method of claim 35, wherein, in step (b), a functional group of the first reactive unit 

2 is transformed into a different chemical moiety in the reaction intermediate. 

1 38. The method of claim 36, wherein the small molecule scaffold contains a protected 

2 functional group. 

1 39. The of claim 38, wherein, in step (b), the functional group is deprotected to produce a 

2 reaction intermediate where the small molecule scaffold contains a deprotected functional 

3 group. 

1 40. The method of claim 35, wherein the reaction product is not a nucleic acid. 

1 41 . The method of claim 35, wherein the reaction product is not a nucleotide or a nucleotide 

2 analog. 

1 42. The method of claim 35, wherein, in step (b), the first and second reactive units react 

2 with one another to produce the reaction intermediate without the assistance of a 

3 ribosome. 

1 43. The method of claim 35 further comprising the step of selecting the reaction product 

2 attached to the first oligonucleotide. 

1 44. The method of claim 43 further comprising the step of amplifying the first 

2 oligonucleotide. 
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1 45. The method of claim 44 comprising the additional step of determining the sequence of 

2 the first oligonucleotide attached to the reaction product so as to determine the identity or 

3 synthetic history of the reaction product. 
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SEQUENCE LISTING 

<110> President and Fellows of Harvard College 
Liu, David R. 
Sakurai , Kaori 

<120> Free Reactant Use in Nucleic Acid-Tempi ated synthesis 

<130> LS1-003PC 

<150> US 60/646,584 
<151> 2005-01-21 

<160> 13 

<170> Patentln version 3.3 

<210> 1 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> 30-mer Oligonucleotide Template (for characterization by PAGE) 
<400> 1 

ggtacgaatt gcactcggga aatccacctt 30 

<210> 2 

<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> 10-mer oligonucleotide Linked to Phosphine 



10 



<400> 2 
aattcgtacc 

<210> 3 

<2U> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> 10-mer oligonucleotide with three-base mismatch 

<400> 3 

aatacatccc 1° 

<210> 4 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> 20-mer oligonucleotide Capture Reagent 



<400> 4 

tcccgagtgc aattcgtacc 



20 
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<210> 5 

<211> 10 

<212> DNA 

<213> Artificial - 

<220> 

<223> 10-mer oligonucleotide Template 
Spetroscopy) 

<400> 5 
ggtacgaatt 



<210> 6 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Template 13 oligonucleotide 

<400> 6 
ttgtgcaacg teat 



<210> 7 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Template 14 oligonucleotide 

<400> 7 

ttcctagtcg tcatgtagta gta 



<210> 8 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Template 15 oligonucleotide 

<400> 8 

tttaagcccg tcatgtagta 



<210> 9 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Template 16 Oligonucleotide 

<400> 9 

ttagcttgcg tcatgta 
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(for characterization by Mass 

10 
14 
23 
20 



<210> 10 
<211> 10 
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<212> DNA 

<213> Artificial 

<220> 

<223> Reactant 17 oligonucleotide 
<400> 10 

cgttgcacaa xu 

<210> 11 

<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> Reactant 18 oligonucleotide 
<400> 11 

cgactaggaa xu 

<210> 12 

<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> Reactant 19 oligonucleotide 
<400> 12 

cgggcttaaa xu 

<210> 13 

<211> 10 

<212> DNA 

<213> Artificial 

<220> 

<223> Reactant 20 oligonucleotide 
<400> 13 

cgcaagctaa xu 



